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CHAPTER  1 


I  INTRODUCTION 

^  The  design  of  optimal  controllers  for  linear  stochastic  systems 
requires  an  accurate  description  of  the  system.  However,  the  construction 
of  an  accurate  model  of  real  systems  is  often  not  possible.  These 
inaccuracies  can  stem  from  the  fact  that  the  adopted  linear  model  may  be 
only  a  first-order  approximation  of  a  nonlinear  system.  Also,  there  may  be 
actual  uncertainty  in  the  parameters  of  the  real  system.  This  type  of 
uncertainty  can  arise,  for  instance,  if  one  wishes  to  design  a  single  type 
of  controller  for  a  large  number  of  similar  systems,  when,  for  example,  the 
system  is  mass  produced  and  it  is  impractical  to  tune  each  controller  to 
each  system.  Another  situation  where  this  type  of  uncertainty  can  arise 
is  when  the  parameters  of  a  single  system  vary  slowly  over  long  periods  of 
time,  perhaps  due  to  wear  or  changes  in  the  environment.  These  uncertainties 
in  this  thesis,  are  grouped  into  a  vector  of  parameters.  We  consider 
^several  ways  of  handling  these  .uncertainties  -La  the  design  process*  ' 

'  -'.rr  f  -  ’  .  '  y  -•  .•/. ,  v  .  '  /»•»  , 

Often,  it  is  possible  to  assign's  prior  distribution  td  the--' 

parameter  vector.  Then  by  considering  the  unknown  parameter  vector  as  a  new 
set  of  system  states,  a  nonlinear  stochastic  control  problem  can  be  formed. 
However,  this  type  of  problem  may  not  be  desirable,  especially  if  the 
uncertainties  were  created  by  linearization  of  a  nonlinear  system  in  the 
first  place.  In  this  case,  one  may  wish  to  build  a  controller  that  exhibits 
adequate  performance  for  all  values  of  the  parameter,  perhaps  based  on  the 
weight  each  value  of  the  parameter  receives  from  the  distribution. 


T 


V  ■  -'F  3  *T  C7  * ’TILT*  *_"*  ^ST  V'  ".*T  T  VTTTTT  VL'I'.TT  L"  ^  V  ■  -  "  7  <■  x  w  v  *  -  -  v  -  r  —s  -  V  -  -  -  TT  -  * 
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Another  method  is  to  assume  that  the  parameters  vary  within  a 
certain  range  about  a  nominal  value.  The  use  of  an  optimal  controller  for 
the  nominal  value  may  result  in  instabilities  or  poor  performance  for  off- 
nominal  values  of  the  parameter.  To  design  controllers  for  these  systems, 
some  performance  must  be  sacrificed  to  desensitize  the  controller. 

A  third  method  of  handling  these  uncertainties  is  to  assume  that 
the  parameters  are  unknown  within  a  given  set.  This  set  may  be  compact, 
or  have  a  finite  number  of  values.  In  many  circumstances,  one  may  design 
an  adaptive  controller  that  identifies  the  unknown  parameter  and  tunes 
itself  to  the  identified  model.  However,  in  some  circumstances  this 
identification  scheme  is  not  practical,  if,  for  instance  the  parameters 
are  slowly  varying,  or  if  the  parameters  jump  to  different  values  at  unknown 
instances  in  time.  In  such  cases,  the  identification  process  may  not  have 
time  to  converge  before  a  new  identification  needs  to  be  made.  Then,  it  may  be 
desirable  to  design  a  controller  that  does  not  identify  the  parameter,  but  has 
acceptable  performance  for  all  the  values  within  the  given  set. 

Any  one  of  these  methods  of  handling  uncertainties  in  parameters, 
or  a  combination  of  them,  can  be  considered  when  designing  controllers  which 
exhibit  desirable  insensitivity  properties.  This  thesis  will  discuss  two 
methods  of  designing  controllers  for  systems  whose  corresponding  models  have 
uncertain  parameters.  We  will  consider  linear  stochastic  models  of  the 
form: 


xt  =  F(0)xt  +  G(9)ut  +  Kwt 

(la) 

y'  *  H(9)xt  +  Lvt 

(lb) 

>- s-  vy- , 


J  -•  -  -  -r  -■ 


,v' 

*£ 


n. 

' 
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where  0  is  a  constant  vector  of  unknown  parameters,  and  v?t  and  vt  are  white 
noise  processes.  A  time  average  of  a  quadratic  cost  is  to  be  minimized: 


J  =  lim 

X  -►  00 


?/ 
1  0 


KVt  + 


u'W  u  )dt 
tut 


(2) 


Equations  (1)  and  (2)  are  not  well-defined  mathematically  since 

(1)  is  generated  by  white  noise,  which  is  not  a  real  physical  process,  and 

(2)  is  an  integral  of  a  stochastic  process.  In  Chapter  2,  Equation  (1)  is  put 
into  Ito  differential  form,  and  Equation  (2)  is  correspondingly  redefined. 

The  exact  assumptions  on  the  unknown  parameters  are  stated.  Finally,  an  exact 
description  of  the  problem  that  is  solved  in  the  design  of  each  type  of 
controller  is  stated. 

The  first  type  of  controller  that  is  considered  we  call  a  least 
sensitive  controller  since  this  controller  minimizes  the  average  cost  over 
the  entire  parameter  set  based  on  the  assumption  that  all  parameters  are 
equally  likely.  The  second  type  of  controller  that  is  considered  we  call  a 
minimax  controller  since  a  controller  is  sought  that  minimizes  the  worst-case 
cost. 


In  Chapter  3,  we  consider  the  unknown  parameter  to  be  in  a  compact 
set  centered  about  a  nominal  value.  We  also  assume  that  each  value  in  the 
set  is  equally  likely,  so  that  a  uniform  distribution  is  induced  on  the  set. 
Therefore,  we  use  a  combination  of  all  of  the  three  types  of  assumptions  on 
the  parameter  set  discussed  above.  The  objective  in  this  chapter  is  to 
design  a  controller  that  minimizes  the  average  cost  over  the  entire  set. 

The  optimal  controller  for  the  nominal  value  is  a  linear  feedback  of  estimates 
of  the  full  state.  To  facilitate  the  solution  of  this  problem,  we  restrict 


the  class  of  controllers  to  the  class  of  linear  feedback  controllers.  However, 
the  dimension  of  the  controllers  is  not  restricted  to  that  of  the  dimension  of 
the  states  since  full-order  controllers  are  not  always  required  to  achieve 
acceptable  performance.  Ashkenzai  and  Bryson  [1]  have  presented  a  method  for 
solution  of  this  problem  when  discrete  distributions  are  assumed.  It  is  shown 
that  this  method  may  be  extended  to  continuous  distributions.  We  consider  the 
performance  of  this  type  of  controller  for  several  examples,  and  these  examples 
show  that  some  performance  must  indeed  be  sacrificed  to  achieve  lower  parameter 
sensitivity. 

In  Chapter  4,  we  again  assume  that  the  unknown  parameter  is  in  a 
compact  set  centered  around  the  nominal  value.  In  this  problem  we  seek  to 
minimize  the  worst-case  performance.  A  way  to  design  these  controllers  is  to 
use  the  optimal  linear  regulator  for  the  model  that  exhibits  the  worst 
performance.  However,  it  is  not  immediately  obvious  that  these  two  problems 
have  the  same  solution  since  the  model  that  is  least  favorable  for  control  may 
not  be  least  favorable  for  state  estimation.  Looze,  Poor  and  others  [7]  have 
shown  that  these  two  solutions  are  equivalent  for  the  case  where  there  is 
uncertainty  in  the  second-order  statistics  of  the  noise  processes.  An 
attempt  is  made  to  use  a  similar  procedure  for  the  case  now  under  considera¬ 
tion,  that  is,  when  there  is  uncertainty  in  the  system  dynamics.  Chapter  4 
explains  how  an  error  model  was  formed  in  the  case  of  uncertainty  in  the  noise 
statistics,  and  how  this  model  was  used  to  show  the  equivalence  of  the  two 
solutions  in  this  case.  However,  for  the  problem  now  under  consideration, 
this  error  model  cannot  be  formed,  and  thus  the  equivalence  of  the  two 
solutions  is  not  clear.  Indeed,  for  the  scalar  case,  we  find  parameter  sets 


such  that  the  two  solutions  are  not  equivalent.  Therefore,  in  Chapter  4, 
we  find  all  the  parameter  sets  satisfying  certain  convexity  conditions  such 
that  the  two  solutions  are  equivalent. 

The  examples  of  Chapter  3  are  then  considered  in  Chapter  4.  The 
least  sensitive  controllers  of  Chapter  3  are  seen  to  exhibit  superior 
performance  over  a  wider  range  of  parameter  values  in  the  sets  under 
consideration,  but, if  the  range  of  parameter  values  off  the  nominal  value  is 
large  enough,  the  minimax  controllers  of  Chapter  4  have  a  lower  maximum  cost 
than  the  least  sensitive  controllers  of  Chapter  3. 

In  Chapter  5,  a  second-order  single-input  single-output  example  is 
considered.  Then,  some  aspects  of  the  design  and  performance  of  the  two 
types  of  controllers  are  discussed  that  are  not  brought  out  in  the  previous 
chapters.  The  order  of  the  system  allows  us  to  investigate  the  relative 
performance  of  reduced-order  least  sensitive  controllers  with  respect  to  the 
full-order  least  sensitive  controllers  and  the  full-order  minimax  controllers 
This  chapter  shows  how  maximin  controllers  for  a  particular  example  may  be 
designed  on  a  numerical  basis.  Also,  the  equivalence  of  this  solution  to 
the  minimax  solution  may  be  analyzed  numerically. 

Finally,  in  Chapter  6,  the  general  properties  of  the  two  types  of 
controllers  are  discussed.  We  also  discuss  the  advantages  and  disadvantages 
of  each  design,  and  thus  outline  what  factors  are  considered  in  choosing 
between  the  two  designs.  Finally,  we  discuss  what  other  designs  may  be 
considered  to  achieve  a  desirable  performance  of  the  control  system. 


CHAPTER  2 


PROBLEM  STATEMENT 


The  system  equations,  (1),  can  be  made  precise  by  modelling 
dyt 

yfc:  y  =  -j— ■  .  Then  the  system  equations  can  be  put  in  Ito  differential 
form: 


dx^  =  F(0)xtdt  +  G(0)utdt  +  Kdw^ 
dyt  =  H(0)x  dt  +  Ldvt 

'w  \ 

t)  is  a  vector  Wiener  process,  with 


(3a) 

(3b) 

(4a) 

(4b) 


and  Q  and  R  are  positive  definite. 

The  parameter,  0,  is  constant  but  not  known  precisely.  However,  0 
is  assumed  to  be  contained  in  a  nonempty,  closed,  compact  set,  ©.  Also, 
the  pairs,  [F(0),G(0)]  and  [F(0) ,H(0) ] ,  are  assumed  to  be  stabilizable  and 
detectable,  respectively,  for  each  060,  The  problem  is  to  choose  ut  to 
minimize  a  time  average  of  the  quadratic  cost  functional,  (2) .  Equation  (2) 

|  becomes  well-defined: 

1  T 

J(uc,0)  =  E  [lim  —  /  (x'Wxxt  +  u^Wuut)dt] 
w,v  T-*°°  0 

|  where  Wx  is  positive  semidefinite  and  is  positive  definite. 


I 


(5) 


At  time  t,  the  controller  has  access  to  past  output  measurements, 

so  that 

ut  =  y(t,yg  ;  s  <  t)  ,  t  >  o  ,  (6) 

where  y  is  an  admissible  control  strategy,  chosen  from  a  certain  class,  R. 
Every  element  of  II  must  satisfy  the  following  conditions  [3]: 

a)  y  must  be  causal, 

b)  y  must  be  such  that  Equation  (3a)  has  a  unique  solution  that 
is  sample-path  continuous, 

c)  y  must  be  such  that  the  cost  function, (5),  is  well-defined. 

When  the  parameter  vector  is  known,  the  solution  to  this  problem 

is  well-known.  The  controller  is  an  optimal  linear  feedback  of  the  least- 
squares  estimate  of  the  plant  states.  Although  an  optimal  controller  can  be 
calculated  for  every  9^0,  the  correct  controller  cannot  be  applied  without 
prior  knowledge  of  9.  Therefore,  when  9  is  unknown,  this  strategy  is  no 
longer  feasible. 

In  the  next  two  sections,  two  suboptimal  solutions  that  are  less 
sensitive  to  variations  in  the  parameter  vector  are  studied. 

The  first  solution  is  based  on  the  assumption  that  any  9€0  has 
an  equal  chance  of  occurring.  Therefore,  a  uniform  distribution  is  assumed 
over  the  set,0.  Then  u^  is  chosen  to  minimize  the  average  cost  over  ©: 

1  T 

J(u  )  -  E{  E  [  lim  ±  /  (xJW  x  +  u'W  u  )dt]}  .  (7) 

9  w,v  T+»  1  0  X 

As  a  further  simplification,  ut  is  assumed  to  be  linear  with  output  feedback. 
The  problem  is  then  reduced  to  a  constrained  optimization  problem. 


The  second  solution  minimizes  the  worst  case  performance;  that  is 

J  =  min  max  J(u  s0)  .  (8) 

€  n  966  C 

When  a  saddle  point  exists,  then  (8)  has  the  same  solution  as  the  problem, 

J  =  max  min  J(u  ,9)  .  (9) 

see  uten  t 

Solution  (9)  is  equivalent  to  using  the  optimal  linear  regulator  for  the 
0€©  that  exhibits  the  worst  cost. 


CHAPTER  3 


A  LEAST  SENSITIVE  SOLUTION 

In  this  chapter  a  suboptimal  solution  is  sought  whereby  a  prior 
distribution  is  assumed  for  0  over  the  set,  ©.  The  problem  is  solved  here 
for  any  continuous  distribution.  However,  since  the  assumption  that  9  is 
equally  probable  over  the  entire  set  is  made,  the  uniform  distribution  is 
assumed  in  the  examples.  The  problem  then  becomes  one  of  choosing  a  control 
to  minimize  the  average  cost  over  the  entire  parameter  set.  To  accomplish 
this  purpose,  the  procedure  described  by  Ashkenzai  and  Bryson  [1]  is  applied 
to  a  series  of  discrete  distributions  that  converge  in  distribution  to  the 
continuous  distribution.  This  procedure  restricts  the  control  set  to  linear 
controllers  with  output  feedback.  The  assumption  of  a  discrete  distribution, 
together  with  a  linear  control  structure,  reduces  the  problem  to  an  optimization 
problem  with  equality  constraints. 

Two  examples  are  then  considered  for  scalar  plant  and  measurement 
equations.  One  example  has  an  unknown  parameter  in  the  plant  dynamics,  and  the 
other  example  has  an  unknown  parameter  in  the  control  gain.  The  performance  of 
the  two  resulting  solutions  will  be  discussed.  In  particular,  it  is  seen  that 
disturbance  attenuation  must  be  sacrificed  as  more  parameter  insensitivity  is 
desired. 


3.1.  Assumptions  on  Distributions 

As  discussed  before,  the  assumption  of  a  uniform  distribution  is 
desired  since  the  parameter  is  unknown  within  a  given  set.  However,  here  we 


only  assume  that  a  continuous  distribution  is  given.  To  facilitate  the  solution 
process,  a  series  of  discrete  distributions  that  converges  to  the  continuous 
distribution  is  created.  Specifically, 


fc(0),  a  probability  density  function  defined  for  0S© 
f.  9-0.  i=l,...,N 

f  a  <  1  1 

N'  0  otherwise 

and  also , 

Fm(0)  F  (0)  as  N  -*•  -  , 

N  C 

where  FN  and  ?c  are  the  corresponding  probability  distribution  functions  for 

f„  and  f  . 

N  c 

3.2.  Restrictions  on  the  Control  Set 

The  control  set  is  restricted  to  linear  output  feedback  controllers 

of  the  form: 


=  C(p)zt 

(10a) 

=  A(p)zt  +  B(p)yt 

(10b) 

where  p  is  a  vector  of  the  entries  in  A(p) ,  B(p) ,  C(p) . 

As  discussed  in  Ashkenzai  and  Bryson  [1],  if  all  the  entries  of 
A,  B,  C  were  specified  as  parameters,  the  parameter  vector  corresponding  to 
the  minimum  cost  would  not  be  unique,  since  only  the  transfer  functions  from 
y  to  ut  are  of  importance.  Therefore,  the  dimension  of  p  must  be  the  smallest 
possible  for  the  specified  order  of  the  controller  and  the  dimensions  of  ut  and  y 
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To  uniquely  specify  all  the  controller  transfer  functions,  the  following 
dimension  of  p  is  required  [1] : 


DIM(p)  =  [DIM(ut)  +  DIM(yt)]DIM(zt) 


In  order  to  standardize  the  minimal  realization  of  the  control 
parameters  in  A,  B,  C,  the  following  block  diagonal  form  of  the  control  matrices 
is  used  [1] : 


(12a) 


and  the  A^'s  are  2x2  blocks  of  the  form: 


P  Ali  P  A2i 


except  when  DIM(zt)  is  odd,  in  which  case  the  last  A.^  is  a  single  scalar 
parameter. 


B  =  B 


where 


(12b) 


(13a) 


Bi  =  (pBli  pB2i 


C  ~  (C^  ^2  •  •  •  ) 


(13b) 

(14a) 


where  each  C.  has  two  columns: 


v  ■ 
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/ 0 

1  \ 

1  PCli 

PC2i  \ 

* 

(14b) 

vN 

PC3i 

P 

PC4i 

-  q 

\  : 


except  when  DIM(zt)  is  odd,  in  which  case  the  last  has  only  one  column: 


/p1  ' 

Cli 


C2i 

\  :  / 


(14c) 


With  the  assumptions  of  a  discrete  distribution  and  of  a  linear 
block  diagonal  form  of  the  controller,  the  problem  can  be  reformulated  as  an 
optimization  problem  with  equality  constraints. 


3.3.  Problem  Redefinition  and  Solution 


The  system  equations,  (3)  ,  together  with  the  specified  block 
diagonal  form  of  the  controller,  (10),  form  a  new  closed  loop  system: 


C)-s<9'p>  C)dt+N<i,>  C) 


(15) 


with 


(dw 

1 

dv 


=  0 


(16a) 


>dwt\  /dwtv 
\dv  /  \  dv^/ 


Vdt 


(16b) 


where  S(9,p),  N(p)  and  V  are  given  by 


9 


3 


>8 

a 


3 


s(e,p) 


G(e)c(p)' 


B(p)HCe)  A(p) 


c  "  ) 

'0  B(p)L  / 

r  °) 

\  0  R  / 


For  discrete  distributions,  the  cost  functional  becomes 


J(p)  -  £  f  J(6  ,p) 

i-1 


Using  (5) , 


J(0  ,p)  =  E  [  lim  i  /  (xTf  x  +  z;C'(p)W  C(P)ZJdt] 

w,v  T-*<=°  1  0  C 

1  T 

J (®i »p)  *  lim  y  /  ^trfwx  E  (xt  xpi  +  tr[C'(p)WuC(p)  E  (ztzp]}dt 
T  “*• 00  0  w.v  w.v 


Xt(0)  =  E 


Vt  XtZt 


W’V  \  Vt  Vt 


be  the  covariance  matrix  of  and  z^,  and  let 


W(p)  = 


0  CT(p)WuC(p) 


Then, 


1  T 

J(9i,p)  =  lim  -  /  tr[W(p)Xt(0.)]dt 


This  expression  can  be  further  reduced  [3], 

J(0i,p)  -  tr[W(p)X(0i)]  ,  (23) 

where  X(9^)  for  each  0^  is  the  steady  state  covariance  of  xtand  z£  and  is 
the  solution  of  the  algebraic  Lyapunov  equation: 


S(0i,p)X(01)  +  X(01)S"(0i,p)  +  N(p)VN'(p)  -  0  .  (24) 


Then, 


N 

J(p,X)  =  E  tr[f  W(p)X(8 .) ]  .  (25) 

i=l 

Minimization  of  Equation  (25)  by  choice  of  p,  together  with  N  Equations  (24), 
form  an  optimization  problem  with  equality  constraints. 

A  Hamiltonian  function  may  be  formed: 


H(p ,X, A) 


N 

I^{tr[fiW(p)X(01)]  +  tr[A(9i){S(0i,p)X(0.)  +  XO^S'^.p) 

+  N(p)VN'(p)}]}  ,  (26) 


where  A(9^)  are  N  Lagrange  multiplier  matrices. 

Necessary  conditions  for  minimization  of  (26)  by  choice  of  p  are 
N  Equations  (24)  plus  N  equations: 


3H 


3X(8 


—  =  A(9i)S(9i,p)  +  S'(9i,p)A(9.)  +  f.W(p)  =  0 


(27) 


and  the  gradient  equations  for  p: 

3H  ^  ^S(s  ,p)  3S  (9  ,p) 

'  w(trUEi  +  A<Si>  +  ~77~~  A<9i)ix(6i,} 


+  tr{A(9i)  VN'(p)  +  N(p) 


3p 


j 


(28) 


for  j*l, . . . ,DIM(p) . 
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Note  that  (24),  (27),  (28)  form  a  coupled  system,  with  (27)  adjoint 
to  (24).  However,  with  a  choice  of  p.  Equations  (24)  and  (27)  determine 
X(8i)  and  A(0^),  and  given  X(6^)  and  A(8^) ,  Equation  (28)  determines  a 
value  of  p  that  gives  a  stationary  value  of  J  in  Equation  (25) . 

The  existence  of  a  solution  for  continuous  distributions  as  a  limit 
of  the  solutions  of  a  series  of  discrete  distributions  follows  from  the 
definition  of  the  Lebesque  integral  and  the  uniqueness  of  p. 

The  covariance  equations  and  the  adjoint  Lagrange,  multiplier 
equations,  (24)  and  (27),  have  unique  solutions  for  all  0£©,  so  that  J(8,p) 
is  well  defined  by  (23).  Therefore,  since  (24)  and  (27)  are  linear,  it  is 
possible  to  find  expressions  for  X(,8)  and  A ( 8 )  which  are  continuous  in  0. 


Then, 


J(8,p)  -  tr[W(p)X(8) ] 


A  Hamiltonian  function  may  be  formed. 


H(p ,X, A)  =  /  tr[W(p)X(0)]f(8)d8 
0 


+  /  tr[A(e){s(e,p)x(e)  +  x(e)s'(e,P)  +  n(p)vn'(p) }]de 
© 


#■  -  /  tr[~^  X(0)]f(8)d0 


j  e 


+  /  tr{  [A(8) 

©  8p 


5Sj;9>p)  +  -S^9?p)  A(8)  ]X(0)  }d8 
9pj  9pj 


+  /  tr{A(0)[~^-  VN'(p)  +  N(p)V  ~  }d8 

©  3pi  3pi 


for  j*l, . . . ,DIM(p) . 


R;  the  properties  of  the  Lebesque  integral  [5],  (26)  and  (28)  converge  to 
(30)  and  (31)  as  Since  p^  is  unique  for  each  solution  corresponding 

to  a  distribution, f^ (6) , 


p  -*■  p  as  N  -*■  °° 
N  c 


where  p  is  the  solution  for  the  continuous  distribution, 
c 

It  remains  to  find  a  procedure  for  solving  the  system  (24) ,  (25) , 
(27),  (28).  An  iterative  procedure  may  be  used.  At  each  step,  J(p)  and 
its  gradients,  ,  can  be  calculated  given  a  distribution,  fN(0),  and 

a  value  of  p.  Then,  J(p)  and  its  gradients  may  be  used  to  determine  a  new 
value  of  p  for  the  next  step.  These  steps  are  repeated  until  a  value  of  p 
is  determined  that  gives  a  stationary  value  for  J(p).  A  conjugate  gradient 
algorithm  from  the  IMSL  Library  is  used  to  perform  the  iterations .  The 
resulting  routine  is  in  the  Appendix.  See  also  References  [4],  [6]  and  [8] 


3.4.  Examples 


The  following  one-dimensional  system  will  be  considered: 


dx^  =  S^x^dt  +  Q^Uj^dt  +  dw^ 


dyt  a  x^dt  +  dv^ 


dw  \  / dwt 


0  J  E  V  dv.  /  \dv 


1  0 


0  1 


J(ut,0)  =  E  [  lim  y  /  (x^  +  u^)dt ] 
w ,  v  - 


S  V*. 
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In  Example  3-1,  02  will  be  known  and  equal  to  1,  0^  will  be  uniformly 
distributed  with  E0^  =  1.  In  Example  3-2,  0X  =  1  and  02  will  be  uniformly 


distributed  with  E02  =  l. 


The  uniform  distributions  are  of  the  form: 


fj  6)  = 

c 


b  -a 
c  c 


a  <  9  <  b 

c  -  -  c 


otherwise 


(34a) 


a  +b 

E  (0)  -  1  - 
c  2 


(34b) 


A  series  of  discrete  distributions  that  converges  to  f  (0)  is: 

c 


fN(e) 


I  e  =  a,  a  +  .--b.€y  a  ,  2(VaN) 

N  V aN  N-l  »  aN  N-l 


0  otherwise 


and  a^,  (b^-a^)  are  chosen  such  that 


(35a) 


EN(0)  =  Ec(0)  =  1 
VARn(0)  =  VAR  C6) 


(35b) 


For  the  uniform  distribution,  Ufa^jb^]  , 


Ec(9)  •  / c  rV 

a  c  c 
c 

a  +b 

E  (0)  =  — ■  — 
c  2 


2 [1  +  2  +  ...  +  (N-2)]aN(bN-aN) 


N(N-l) 


[l2  +  22  +  ...  +  (N-2)2](bN-aN)2 


N(N-l) 


Let 


S  =  0 
o 


SN  =  SN-1  +  N 


Then  for  N  >  2,  [l2  +  22  +  ...  +  (N-2)2]  =  Also, 


[1  +  2  +  ...  +  (N-2)]  -  (N-2)(N-1) 


Therefore, 

VARjjO)  =  (n  ”  4)  aN  +  (n  “  A")  bN  "  I  aNbN 


.  (N-2)  N-2  ,  \  2 

+  N  +  2  '^N  N' 

N  N  N  N(N-l)  N 


VARn(6)  = 


11  N-2 

N  4  . 2 


N(N-l) 


^N_3N^ 


Then  using  Equations  (35b) 


(bN~aN^ 


(b  -a  ) 
c  c 


^2  (i  '  4  +  — ) 
\N  4  N(N-l)  / 


N 


1  - 


(W 


(3< 


(A( 


(a: 


(a: 


Given  a  continuous  distribution,  (3A) ,  Equations  (35a),  (39),  (Al)  describe 


the  series  of  discrete  distributions  converging  to  f  (9) 


The  block  diagonal  form  for  first  and  second-order  controllers 


20 


1st  order  - 


dzt  =  plztdt  +  p2dyt 
The  closed  loop  system  is 


The  closed  loop  system  is 


Then  Equations  (24),  (27),  (28)  reduce  to: 
(24)  : 

291Xl  +  2}^  +  1  =  0 

P2X1  +  (01  +  P2)X1  +  x3  =  0 
2p2X2  +  2p1X3  +  p2  =  0 


(42) 


(43) 


(44) 


(45) 


'J! 


V* 

c* 

% 

j 

1 

21 

- 

K 

(27)  : 

2Q^1  +  2p2A2  +  f  =  0 

u 

.  ' 

.* 

9 

Aj  +  (0j_  +  P!>a2  +  p2A3 
2A2  +  2P]A3  +  f±  =  0 

=  0 

J 

J 

m 

\ 

(28): 

\ 

f  -  =  2A2X2  +  2A3X3  -  0 

w 

3p2  *  2A2X1  +  2A3X2  +  4A 

3p2  ’  0  ' 

The  gradient  method  discussed  earlier  will  be 

used  to  solve  these  % 

equations 

.  However,  since  the  procedure  is  a  local  one. 

an  adequate  starting 

point  must  be  found.  One  such  point  that  has  proven  useful  is  the  optimal 

■ 

regulator 

solution  for 

This  can  be  found  by 

J 

solving  the  well- 

•A 

known  optimal  regulator  equations 

and  transforming  them 

into  the  block 

diagonal 

coordinates.  The  optimal 

regulator  for  0^  =  1  is  described  by  [3]: 

a 

u  =  -  Px 
t  t 

(46a)  2 

dxt  =  Kx  dt  +  (1  -  P  - 

K)x^dt  +  Kdv^ 

(46b)  :] 

- .; 

2P  -  P2  +  1  =  0 

P  «  1  +  Ji  =  2 .414 

(46c) 

f; 

2K  -  K2  +  1  =  0 

K  =  1  +  /2  =  2.414 

(46d)  * 

But,  in  block  diagonal  coordinates 

5 

0 

;■) 

V 

u  =  z 
t  t 

• 

*<1 

si 

H 

So  the  transformation,  x  =  -  —  z 

t  P  t 

is  used,  and 

' 

dz  t  =  -  PKx^dt  +  (1  - 

P  -  K)z  dt  -  PKdv 
t  t 

.  J 

Comparing  with  Equation  (43) , 

n 

y 

¥ 

.  «  A  ■  -  *-  f  —  '■  .  -  -*i 

« 

•( 

> 

^  -v  "S. 
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dzt  =  P2xtdt:  +  piztdt  +  p2dvt  » 

we  find 

px  =  -  (1  -  P  -  K) 
p2  =  -  PK 

With  P  =  2.414  and  K  =  2.414,  the  starting  point  is  calculated  as, 


Px  =  -  3.828 
P2  =  -  5.828 

Optimal  values  for  three  uniform  distributions  are  calculated.  The 


solutions  for  some  approximating  discrete  distributions  are  also  shown. 


U[0 . 9 ,1 . 1] 

VARG  = 

0.00333 

N 

P1 

p2 

J 

3 

-3.8275 

-5.8915 

16.6942 

7 

-3.8273 

-5.8917 

16.6948 

15 

-3.8273 

-5.8917 

16.6949 

continuous 

-3.8273 

-5.8917 

16.6949 

U[0.7,1.3] 

VAR  = 

0.03000 

N 

pl 

P2 

J 

3 

-3.8505 

-6.3474 

18.2374 

7 

-3.8453 

-6.3577 

18.2717 

15 

-3.8443 

-6.3595 

18.2^76 

continuous 

-3.8441 

-6.3600 

18.2792 

U[0.5 ,1. 5 ] 

VARG  = 

0.08333 

N 

P1 

P2 

J 

3 

-3.9481 

-7.1189 

20.9582 

7 

-3.9333 

-7.1679 

21.1293 

15 

-3.9299 

-7.1767 

21.1605 

continuous 

-3.9289 

-7.1791 

21.1693 

Note  that  when  the  variance  is  small,  convergence  to  the  continuous 


solution  is  very  rapid,  as  can  be  seen  especially  in  the  U[0.9,l.l]  distribu¬ 
tion,  where  four  significant  figures  are  achieved  at  N=3.  However,  the 
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convergence  rate  decreases  as  the  variance  increases ,  so  that  for  the 
U[0.7,1.3]  distribution  four  significant  figures  are  not  achieved  until 
N=15,  while  for  the  U[0.5,1.5]  distribution  even  more  exact  distributions 
are  required  for  the  same  precision. 

Once  optimal  parameters  for  a  particular  distribution  are  found, 
it  is  interesting  to  see  how  the  actual  cost  varies  as  a  function  of  0.  It 
is  desired  that  the  cost  incurred  for  a  particular  0  be  as  close  as  possible 
to  the  cost  incurred  if  the  optimal  regulator  was  used  for  that  value  of  0 . 
Therefore,  defining  JLg(0)  as  the  cost  incurred  by  the  least  sensitive 
controller,  and  dQpT(0)  as  the  cost  incurred  by  an  optimal  regulator  designed 
for  0,  a  performance  measure  that  may  be  used  is  a  plot  of  JLg(0)  “  ^qPT^ 
versus  0.  Some  interesting  properties  of  these  systems  can  be  discovered. 

Figures  1  to  3  show  JLg(®)  ~  ^OPT^  f°r  the  t*iree  distributions 
considered.  From  these  curves,  several  properties  of  the  least  sensitive 
solutions  can  be  discerned. 

1)  As  can  be  expected,  the  relative  cost  increases  as  the  true 
value  of  9  deviates  from  the  mean.  This  is  a  desirable  property  since  less 
performance  is  sacrificed  as  the  true  value  of  0  moves  toward  the  nominal 
value . 

2)  JQpT(0)  is  not  achieved  at  any  0.  If  JopT(0)  was  achieved  at 
a  value  of  9,  it  would  be  an  optimal  regulator  for  that  9.  Therefore,  the 
least  sensitive  solutions  are  not  optimal  for  any  0.  Also,  this  fact  shows 
that  disturbance  attenuation  is  always  sacrificed  to  achieve  less  sensitivity. 

3)  More  disturbance  attenuation  is  sacrificed  as  less  sensitivity 
is  desired.  This  property  may  be  made  clearer  by  investigating  the  perfor¬ 
mance  at  the  mean: 


■1  »>■  »■>-  k2 


1 

.1 

■J 

a 


J.JIIP1.  fi "jwtttt*:  .  .  i  w.  j*sw."  i  r  t  <1,  > :  \ 1  ■ :  ■ .  ■  j  >.,rjnw 


Distribution 


VAR0 


JLS(1-0)  *  JOPT(1‘0) 


U[0.9,l.l]  0.00333  0.03 
U[0.7,1.3]  0.03000  0.31 
U[0.5 ,1.5]  0.08333  1.39 


As  can  be  seen,  performance  at  the  nominal  value  of  9  deteriorates  as  the 
variance  of  9  increases. 

4)  The  least  sensitive  controller  is  biased  in  the  sense  that 
the  minimum  value  of  the  relative  cost  occurs  at  a  value  of  0  that  is  higher 
than  the  nominal  value.  This  bias  may  be  quantified  by  calculating  the 
value  of  0  at  the  minimum  as  a  percentage  of  the  full  deviation  possible. 


Distribution 


Bias 


Full  Deviation 


U[0.9,l.l] 
U[0.7 ,1.3] 
U[0.5,1.5] 


15%  above 
33%  above 
45%  above 


0.1 

0.3 

0.5 


Figure  4  shows  -IqpjC0)  as  a  function  of  9 .  As  can  be  seen  from  this  figure, 
the  controller  is  biased  toward  9's  that  are  associated  with  higher  costs. 
Also,  the  percent  bias  increases  because  Jqp^C9)  increases  at  an  increasing 


rate . 


Second-order  controllers  may  be  calculated  for  this  example  using 
the  same  algorithm.  To  calculate  a  starting  point,  a  comparison  is  made 
between  the  transfer  functions  of  the  first  and  second-order  controllers: 


first  order: 


U(s)  =  P2 
Y  (s)  s  -  p1 


second  order : 


U(s) 

Y(s) 


P3P1 

p4(s  +  -?r 


) 


(s  -  PL) 


S  +  p,  - 


P2  + 


P1  -  plp2  +  P1 
s  +  P1  "  P2 


Using  the  starting  point  for  the  first-order  controllers  a  starting  point 
for  the  second-order  controller  can  be  formed  by  matching  the  two  transfer 
functions.  Let 


p^  (second  order)  =  p^  (first  order) 

P4  (second  order)  =  p2  (first  order) 

Then,  p2  and  p^  can  be  chosen  so  that  the  second  pole  is  stable,  and  nearly 
cancels  the  zero.  Therefore,  using  the  starting  point  for  first-order 
controllers : 

p  =  -  3.828 
P2  =  -  5.828 

set 

P;L  =  -  3.828 
P4  =  -  5.828  . 

Let  Px  -  P2  =  1, 


then  p2  =  P^  -  1  *  -  A. 828  . 

Then,  so  that  s  +  p^  -  p2  cancels  the  zero, 


set 


1.522 


1 


Therefore,  the  starting  point  for  the  second-order  controllers  is: 


Second-order  controllers  were  calculated  for  the  same  distributions. 


The  results  are  summarized  below. 


U[0.9,l.l] 

P1 

P2 

P3 

P4 

J 

1st  order 

-3.8273 

-5.8917 

16.6949 

2nd  order 

-4.4948 

-5.0194 

1.5411 

-5.9062 

16.6948 

U[0 . 7 ,1 . 3] 

pl 

P2 

P3 

P4 

J 

1st  order 

-3.8441 

-6.3600 

18.2776 

2nd  order 

-5.5380 

-5.3948 

1.6626 

-6.4557 

18.2722 

U[0.5,1.5] 

P1 

P2 

P3 

P4 

J 

1st  order 

-3.9289 

-7.1791 

21.1693 

2nd  order 

-7.4618 

-6.0202 

1.8371 

-7.3618 

21.1397 

The 

percent  improvement  in  J  may 

be  calculated: 

Distribution 


%  AJ 


U[0. 9,1-1]  0.0006% 
U[0.7 ,1.3]  0.0295% 
U[0.5,1.5]  0.1398% 


These  figures  show  that  the  improvement  found  using  the  second-order  control¬ 
lers  is  very  low,  but  the  improvement  increases  as  the  variance  of  6  increases 
Figures  5,  6  and  7  show  a  plot  of  J^g  (first  order)  -  JLS  (second  order)  as 
a  function  of  8  for  the  three  distributions.  These  curves  show  that  this 
second-order  controller  does  not  make  improvement  over  the  entire  range  of  8. 
However,  it  can  be  seen  that  the  improvement  occurs  for  0's  that  are 
associated  with  higher  costs. 

In  Example  3-2,  ^  =  1  and  considered  to  be  unknown  with 

E02  =  l.  Using  the  same  method  as  in  Example  3-1,  with  the  same  starting  points 


optimal  values  for  first  and  second-order  controllers  are  calculated. 


Figure  6.  J,  „(lst  order)  -  J_(2nd  order)  vs.0 


U[0.9,l.l] 


VAR0  =  0.00333 


first  order 
second  order 


Pi  P2 


p3 


J 


-3.8031  -5.8707 

-3.4583  -4.7587 


16.7323 

1.5493  -5.9083  16.7314 


U[0. 7,1-3]  VAR0  =  0.03000 


first  order 
second  order 


Pi  P2 


p3 


J 


-3.6087  -6.2470 

-4.1535  -5.1693 


19.0127 

1.7693  -6.5897  18.9244 


Ut0.5,1.5]  VAR0  =  0.08333 


first  order 
second  order 


P1  P2  P3 


J 


-3.2658  -7.2651 

-5.6600  -6.1005 


25.8909 

2.2744  -8.2686  24.9534 


Figures  8,  9  and  10  show  plots  of  JLS(0)  “  ^qPT^  versus  0  for  the  first- 
order  controllers.  Basically,  the  conclusions  here  are  the  same  as  in 
Example  3.1,  and  only  a  few  differences  will  be  noted  here. 

1)  The  relative  cost  for  the  parameter  in  the  control  gain  is, 
in  general,  higher  than  the  relative  cost  for  the  parameter  in  the  plant 
dynamics . 


2)  These  controllers  are  biased  below  the  mean.  To  quantify: 


Distribution 


Bias 


Full  Deviation 


U[0.9,l.l] 
Ut0.7,1.3] 
U[0.5 ,1.5] 


15%  below 
42%  below 
b6%  below 


0.1 

0.3 

0.5 


Notice  also  that  these  biases  are  greater  than  in  Example  3.1. 

Figure  11  shows  that  is  greatest  for  parameter  values  less 

than  the  mean,  which  shows  why  the  biases  are  below  the  mean.  Also  note  that 


JopT(e)  is  generally  greater  than  in  Example  3.1,  and  increases  at  a  faster 
rate  than  in  Example  3.1,  thereby  explaining  why  the  relative  costs  and  biases 
are  greater  than  those  in  Example  3.1. 

For  second-order  controllers,  the  degree  of  improvement  is  also 
greater  than  that  of  Example  3.1. 


Distribution 

%AJls  (Ex.  1) 

%AJls  (Ex.  2) 

UtO.9,1.1] 

0.0006% 

0.0053% 

U[0.7,1.3] 

0.0295% 

0.4644% 

U  [  0 . 5 , 1 . 5  ] 

0.1398% 

3.621% 

Note  that  in  this  case,  the  improvement  can  become  significant  although  still 
not  very  great. 

Figures  12,  13  and  14  show  J^g  (first  order)  -  JLS  (second  order) 
for  this  example.  As  seen  before,  a  savings  is  not  made  over  the  entire 
parameter  range,  but  the  improvement  is  made  for  0's  associated  with  higher  • 
costs . 


3.5.  Conclusions 

To  conclude,  it  has  been  shown  how  the  method  of  Ashkenzai  and 
Bryson  may  be  used  to  find  least  sensitive  controllers  for  systems  with 
parameters  described  by  continuous  distributions.  The  rate  of  convergence 
of  the  solution  is  very  rapid  although  it  slows  down  as  less  parameter 
sensitivity  is  desired. 

The  performance  of  the  least  sensitive  controllers  was  discussed. 


It  was  concluded  that  the  performance  of  the  system  decreased  as  the  distance 


» 
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of  the  real  parameter  from  the  nominal  value  increased .  Disturbance 
attenuation  was  seen  to  be  sacrificed  as  less  parameter  sensitivity  was 
desired.  Also,  the  controllers  were  biased  towards  those  values  of  0 
associated  with  higher  costs.  Finally,  it  was  seen  that  second-order 
controllers  did  not,  in  general,  demonstrate  much  improvement  over  the 
first-order  controllers. 
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CHAPTER  4 

A  MINIMAX  SOLUTION 

In  this  section,  we  seek  to  find  the  optimal  controller  for  the 
value  of  the  unknown  parameter  that  corresponds  to  the  worst-case  model. 

This  problem  is  characterized  by  (8)  : 

J(u  ,9  )  =  min  max  J(u  ,0)  , 

0  u  SII  066  C 

where  J(ut>9)  is  given  by  (5).  A  way  to  calculate  the  solution  is  to  find 
the  optimal  regulator  for  all  possible  models  and  then  to  maximize  the 
corresponding  cost  over  che  parameter  set.  This  problem  is  characterized  by 
(9)  : 

J (u  ,  9  )  *  max  min  J(u  ,6) 

°  °  9€©  uen  c 

For  the  solutions  of  these  two  problems  to  be  equivalent,  a 
saddle  point  must  exist  at  the  solution  point.  Looze,  Poor,  et  al.  [7]  have 
shown  that  a  saddle  point  exists  when  the  uncertainties  are  in  the  second- 
order  statistics  of  the  noise.  However,  some  additional  convexity  assumptions 
are  needed  for  the  parameter  set,  ©,  in  this  proof.  We  have  made  an  attempt 
to  use  the  same  method  to  show  the  existence  of  a  saddle  point  under  the 
current  assumptions.  However,  this  cannot  be  done  since,  as  seen  later  in 
the  case  of  scalar  systems,  the  saddle  point  does  not  exist  for  all  types  of 
parameter  sets  under  consideration.  Therefore,  we  state  what  conditions  must 
be  satisfied  in  Theorem  2,  and  then  for  scalar  systems,  we  find  the  largest 
sets  such  that  the  saddle-point  condition  holds. 
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The  examples  from  the  previous  section  are  then  considered.  The 
minimax  solutions  for  these  examples  are  calculated ,  and  using  the  results 
for  scalar  systems,  the  minimax  solution  is  shown  to  enjoy  the  saddle-point 
property  for  the  sets  under  consideration.  The  performance  of  the  minimax 
controller  is  compared  to  the  performance  of  the  least  sensitive  controller. 

It  is  generally  seen  that  the  least  sensitive  controllers  exhibit  superior 
performance  over  a  wide  range  of  parameter  values  in  the  set. 

Finally,  we  include  a  note  on  the  notation  used  here.  A  subscript 
"9"  denotes  that  the  matrix  in  question  is  a  function  of  the  unknown  parameter 
vector.  A  subscript  of  "o",  or  no  subscript,  indicates  that  the  matrix  is 
evaluated  at  the  maximin  solution. 


4.1.  The  Maximin  Solution 

To  solve  (9) ,  the  optimal  linear  regulator  is  determined  for  all 
S€0,  and  the  resulting  cost  is  maximized  over  9  €  ©.  The  optimal  linear 
regulator  is  [3]  : 


“e  ■  -  'CsVt 


-  <Fe  -  +  I.H'(LRL-)-1(dyt  -  H^dt) 


pe'ps  +  V9  -  VeKlVe  +  “x  '  0 


Ve  +  Ve  -  £9He(LEL')'liVe  +  KQK'  ’  0 


(47a) 

(47b) 

(47c) 

(47d) 


The  optimal  cost  is 

J(u0,9)  =  tr[P0KQK'  +  P^w'V'P  .  Z0  ] 


(48) 


Maximizing  (48)  over  0€©  with  PQ  and  Eg  given  by  (47c)  and  (47d)  gives  the 
maximin  point,  (uq,9o).  The  cost  at  this  point  is 


J(u  ,0  )  =  trfP  KQK'  +  P  G'W  1G"P  E  ] 

O  O  O  O  O  U  O  o  o 


(49) 


In  order  to  investigate  the  existence  of  a  saddle  point  at  this 
solution,  an  expression  for  the  cost  of  the  maximin  control  at  other  values 
of  0  is  needed.  By  applying  the  control,  (47)  evaluated  at  9  ,  to  the  system, 
(3),  the  following  closed-loop  model  is  formed.  Let 


S  = 


‘EH  "  (LRL ')_1H. 


-G  W_1G 'P 
0  u 

F  -  GW-1G'P 


-  EH' (LRL-1) H  ^ 


(50a) 


N  - 


'K  0 

•  0  -  EH  “  (LRL ')_1L 


(50b) 


W  = 


W 


j_1r:'p  ^ 


(50c) 


0  PGW  G"P ■ 

u 


Then  the  closed-loop  model  is 


(I1)  ■  *(■')••  *  C) 


(51) 


The  corresponding  cost  is  (under  the  assumption  that  S  is  a  stable  matrix 
for  all  0£©  —  which  will  be  justified  later) 


J (u  ,9)  =  tr (W  X) 
o 


(52) 


where  X  is  the  positive  definite  solution  to 


SX  +  XS'  +  NVN '  =  0 


(53) 


Then  Equation  (52)  reduces  to 


J(u  ,0)  =  tr[W  X,  +  PGW_1G-PX,]  .  (54) 

o  L  x  1  u  3 

Also,  (53)  leads  to  three  equations: 

F-X.  +  X,Fr  -  GqW_1G^PX"  -  X0PGW_1g;  +  KQK'  =  0  (55a) 

9119  0u2  2u9 

FQX„  -  GaW_1G^PX_  +  X. H ' (LRL ') _1HZ  +  X«(F  -  GW_1G'P  -  EH'(LRL')_1H) " =  0  (55b) 
026u  319  2  u 

(F  -  GW-1G'P  -  EH ' (LRL ") _1H) X_  +  X,(F  -  GW-1G 'P  -  EH ' (LRL ')_1H) " 
u  3  3  u 

~1  _i  _1  (55c) 

+  EH ' (LRL  ')  HE  +  EH ' (LRL')  H0X2  +  X'H0(LRL')  HE  »  0 

Equations  (54)  and  (55)  represent  the  cost  incurred  by  the  maximin  controller 
for  models  that  correspond  to  all  the  different  parameter  values  in  the  set. 


4.2.  The  Minimax  Solution  -  Existence  of  a  Saddle  Point 

The  solution  to  the  minimax  problem,  (8),  exists,  and  is  equal 
to  (9)  if  a  saddle  point  e'  sts  at  the  solution,  J(u  ,9o).  The  saddle  point 
is  characterized  in  the  following  theorem. 

Theorem  1  [  2  ]  :  There  exists  a  pair  (uq,0  )€II  *  0  satisfying  the  saddle-point 
condition  : 

J(u  ,9)  <  J(u  ,9  )  <  J(u  ,9  )  Vu  6  n  ,  ,  (56) 

o  -  oo  -  to  t 

if  and  only  if  the  values  of  (8)  and  (9)  are  equal. 


It  is  necessary  to  show  that  condition  (56)  holds.  The  right-hand 

side  follows  immediately  from  the  fact  that  uq  is  the  optimal  linear  regulator 

for  0  .  Looze,  Poor,  et  al.  [7]  prove  the  left-hand  side  for  their  case  by 

establishing  the  existence  of  the  maximin  solution  to  (9) ,  and  then  showing 

that  the  condition  that  the  Frdchet  differential  of  J(u  ,9)  at  (u  ,6  )  is 

t  o  o 

nonpositive  for  all  9  €  ©  is  equivalent  to  the  left-hand  side  of  (56). 

To  follow  similar  logic  in  the  present  case,  a  further  restriction 

must  be  made  on  the  parameter  set.  Specifically,  we  assume  that  9  may  be 

split  into  three  vectors,  9_,  0_  and  0„,  where  the  following  conditions  hold: 

r  b  H 

Fe  -  F(9p)  eFeeF 

S  ■  G(V  9GeeG  (57) 

“a  '  H<V  9hssh 

and  ©_,  ©„  and  0  are  disjoint,  convex  sets,  and  0„x  ©  x  ©„  =  ©. 

r  O  H  r  O  tl 

Since  the  pairs  (Fg,Gg)  and  (Fg,Hg)  are  stabilizable  and  detectable, 

respectively,  and  since  u  is  an  optimal  regulator,  J(u  ,0)  is  bounded  and 

0  0 

continuous  in  9.  Therefore,  since  ©  is  compact,  the  maximin  solution  to  (9) 
exists . 

We  now  make  the  dependence  of  J(uq,9)  more  explicit: 

J(uq,9)  =  J(uo,Fe,GQ,H9)  ,  (58) 

and  we  will  refer  to  (F  ,G  ,H  )  as  the  point  corresponding  to  (9).  Let 

9  0  0 

(u  ,9  )  =  (u  ,F,G,H)  be  the  maximin  solution.  Then  the  Fr^chet  differential 
o  o  o 

of  (48)  with  respect  to  F  ,G  ,H  at  (F,G,H)  must  be  nonpositive  in  every 

9  0  9 

direction  into  ©.  The  Frdchet  differential  of  (48)  is  now  calculated  with 
respect  to  each  of  the  parameter  vectors. 


we  start  with  Equation  (48) , 


J(VFe>Ge’V  =  trfp,K^'  4  paGX  W 


u  8  9  8- 


The  differential  of  J(u  ,Fa,G.,H.)  with  respect  to  F  at  (F,G,H)  is 

o  y  y  y  y 

6 J (u  , F,G ,H  ;  AF)  =  tr[5P  KQK'  +  5PGW_1G'PE  +  PGW_1G' APE +  PGW-1G'P5E 1 
o  u  u  u 


where  6P  and  5E  are  the  Frechet  differentials  of  PQ  and  EQ  at  (F,G,H),  and 

y  y 


can  be  calculated  using  (47c)  and  (47d) . 
For  6P  we  use  (47c)  : 


F'P  +  P  F  -  P  GW_1G"P  +  W  =  0 


AF 'P  +  F'SP  +  SPF  +  PAF  -  5PGW~1G'P  -  PGW  1G^5P 

u  u 

(F  -  GW_1G"P) '5P  +  6P(F  -  GW_1G'P)  +  AF'P  +  PAF 
u  u 


Let  A  = 


F  -  GW  G'P,  Then, 
u 


5P  *  /  e  (AF'P  +  PAF)e  dt 
0 


For  i E  we  use  (47d) 


FI  t  IF'  -  EH '(LRL')  HE  +  KQK'  =  0 


AF?:  4  F  M  +  5EF'  +  AFE  -  oEH'(LRL') 


'hi E  -  E 


H'(LRL')_1H6E  =  0 


(F  •  EH' (I.RL')'1H)6E  +  5E(F  -  EH '  (LRL ')_1H)  '  +  AFE  +  EAF'  =  0 


Let  B  -  F  -  ZH '  (LRL ') 


Then , 


eBt(AFZ  +  EAF')eB  Cdt 


Substituting  (61)  and  (60)  into  (59) , 


6J(uq,F,G,H  ;  AF)  =  tr{6P[KQK"  4  GW^G'PE  +  EPGW^G']  4  [PGW^G'PjfiE} 

=  tr{ /  eA  t[AF"P  +  PAF]eAtdt [KQK'  +  GW^G'PE  4  EPGW^G'] 

4  [PGW-1G'P]7  eBt[AFE  4  EAF"]eB  Cdt) 

U  0 

=  tr{[AF'P  4  PAF]  /  eAt  [KQK"  4  GW^G'PE  4  EPGW~1G"]eA  fcdt 

GO  ~  ^ 

4  [AFE  4  EAF']/  e®  t [PGW~1G"P]eBtdt } 

0  U 

6J(uq,F,G,H  ;  AF)  =  2tr{ [M  -P  4  EM^AF}  (62) 

(F  -  GW~1G^P)Mf1  4  MF1(F  -  GW~^G'P)  "  4  KQK"  4  GW^G'PE  4  EPGW^G'  -  0  (63a) 

« 

(F  -  EH'(LRL')-1H)  "  4  M^CF  -  EH"(LRL")_:LH)  4  PGVfVp  =  0  .  (63b) 

Equation  (62)  with  Equations  (63)  describes  the  Frechet  differential  of 
J(uo,Fq,G0,H0)  with  respect  to  FQ  at  (F,G,H). 

In  order  to  find  the  differential  with  respect  to  G  at  (F,G,H)  we 

v 

again  start  with  (48) : 

J<WVH9>  *  trtPeKQK'  +  P^W^G'P^]  . 

Then,  using  (47c)  and  (47d) , 

J(VWV  ■  "‘Vo  +  PsE9H9<LB1')'1,19£9I  • 

The  differential  of  J(u  ,F.,G.,HQ)  with  respect  to  G.  at  (F,G,H)  is 

o  t)  u  w 

6J(uq,F,G,H  ;  AG)  =  tr{6P[EH'(LRL')_1HE] }  (64) 

where  6P  is  now  the  Frdchet  differential  of  Pa  with  respect  to  G^  at  (F,G,H), 
and  can  be  calculated  using  (47c)  : 
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£ 


F'6P  +  6PF  -  6PGW_1G 'P  -  PAGW_1G'P  -  PGW^AG'P  -  PGW_1G'6P  =  0 
u  u  u  u 


(F  -  GW_1G'P)'6P  +  6P(F  -  GW_1G'P)  -  PAGW_1G'P  -  PGW~1AG'P  =  0 
u  u  u  u 


SP  =  /  e  [-PAGW  G'P  -  PGW-xAG'P]e  dt 
0 


(65) 


Substituting  (65)  into  (64) 

00 

5j(u  ,F,G,H  ;  AG)  =  tr{/  pA'tf  -1  _  pGW~l 


e"  ’’[-PAGH  *G'P  -  PGWu"AG'P]eAtdt[EH'(LRL')“1HZ]} 


,-l. 


,-l, 


=  trUPAGW^G'P  +  PGW^AG'P]/  eAt  [-EH' (LRL ')_1HE  ]eA  Cdt} 


.-1, 


6J(uo,F,G,H  ;  AG)  =  2tr{ [W  G'PMgPjAG} 


(66) 


(F  -  GW_1G 'P)M  +  H  (F  -  GW_1G 'P)  '  -  EH'(LRL')_1HE  =  0  .  (67) 

U  G  Cj  u 


Equation  (66)  with  (67) describes  the  Frechet  differential  of  J(u  ,F  ,GQ,H.) 

O  U  0  9 


with  respect  to  GQ  at  (F,G,H). 

U 


In  order  to  find  the  differential  of  J(u  ,F  ,G  ,HQ)  with  respect 

o  o  y  y 


to  H  at  (F,G,H),  we  again  start  with  (48).  Then, 


5J(uo,F,G,H  ;  AH)  =  tr{  [PGW^G'P]  6E  } 


(68) 


where  5E  is  now  the  Frdchet  differential  of  E  with  respect  to  HQ  at  (F,G,H), 


and  can  be  calculated  using  (47d)  : 


F6E  +  5EF'  -  6EH'(LRL')-1HE  -  EAH'(LRL')-1HE  -  EH ' (LRL ')'1AHE 


-  EH' (LRL')  LH6E  =  0 


(F  - 


EH'(LRL')_1H)5E  +  6E(F  -  EH ' (LRL ') _1H)  -  EAH ' (LRL  ') _1HE 


-  EH '(LRL')  JAHE  =  0 


5E  =■  /  eBt[-EAH'(LRL')"1HE  -  EH'(LRL ')_1AHE  ]eB  Cdt 
0 


(69) 


a; 

.v. 


,‘Y 


A  <* 


'I 
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Substituting  (69)  into  (68) , 

00  A 

6J(u  ,F,G,H  ;  AH)  *  tr{ [PGW-1G'P]/  eBt[-IAH'(LRL")-1HZ  -  EH'(LRL')-1AHZ] 

°  U  0  - 

•  eB  Cdt} 

=  tr{ [£AH^(LRL')-1HE  +  EH'(LRL>)-1AHE]/  eB  C [-PGW_1G'P]eBtdt} 

0  U 

6J(uq,F,G,H  ;  AH)  =  2tr{[EMHEH'(LRL')~1]AH}  (70) 

(F  -  ZH'(LKL  )“1H)"Mh  +  M^F  -  EH'(LRL')_1H)  -  PGW^G'P  =  0  .  (71) 

Equation  (70)  with  (71)  describes  the  Frechet  differential  of  J(u  ,F.,G.,H_) 

O  “  U  U 

with  respect  to  H.  at  (F,G,H).  This  completes  the  calculation  of  the  Frechet 

y 

differentials . 

Consider  an  arbitrary  point,  (FQ,G.,HQ),  in  0.  Since  ©_  and  ©„ 

y  y  y  run 

are  convex,  the  line  segment  joining  (F,G,H)  and  (F  ,G  ,H  )  is  in  0  and 

y  y  y 


(AF ,AG,AH)  =  (FQ  -  F,  GQ  -  G,H0  -  H)  (72) 

is  a  direction  into  0.  Since  (F,G,H)  is  the  maximin  solution,  differentials 
(62),  (66)  and  (70)  are  nonpositive.  If  by  setting  differentials  (62),  (66) 
and  (70)  nonpositive  we  can  show  that  the  left-hand  side  of  (56)  is  true, 
then  a  saddle  point  exists.  This  can  be  summarized  in  the  following  theorem. 
Theorem  2 :  Let  the  Frechet  differentials  (62)  ,  (66)  ,  (70)  corresponding  to 
the  maximin  solution  be  nonpositive: 


tr ( [Mpi?  +  ZMF2](F6  “  F>>  <  0 

vf9<5 

(73a) 

tr{[W^1G'PMGP](G0  -  G)}  <  0 

vceE  ®g 

(73b) 

tr{[ZMHZH'(LRL')‘1](H0  -  H) }  <  0 

neeH 

(73c) 

where  M^,  M^,  M^,  ^  are  given  by  (63a),  (63b)  (67),  (71). 


Consider  the  left-hand  side  of  (56)  in  view  of  (54) . 

J(uo,F0,G0,H0)  <  J(uq,F,G,H)  (74a) 

«i<y19  +  p™ulG'pi381  :  "tVio  +  <«b) 

where  X,  ,  and  XOQ  are  given  by  Equations  (55),  and  X.  n  and  X__.  are  given  by 
id  jy  jlu  jU 

Equations  (55),  evaluated  at  8  . 

If  (73)  implies  (74),  then  a  saddle  point  exists,  and  the  solutions 
of  (8)  and  (9)  are  equal. 

We  have  shown  that  (73)  is  true  since  the  maximin  solution  exists. 
Also,  the  right-hand  side  of  (56)  is  known  to  be  true.  Therefore,  if  we  can 
show  that  the  left-hand  side  of  (56)  is  true  using  (73),  Theorem  1  can  be 
used  to  establish  the  existence  of  a  saddle  point  and  the  equivalence  of  the 
two  solutions. 

Looze,  Poor,  et  al.  [7]  take  advantage  of  the  fact  that  the  system 
parameters  are  known,  and  by  use  of  the  transformation,  e  *  x  -  x  ,  an  error 
model  can  be  formed.  Using  this  error  model,  an  expression  for  J(uq,9)  can 
be  found  such  that  the  corresponding  equations  to  (73)  and  (74)  are  identical. 
However,  under  the  present  circumstances,  this  comparison  is  not  possible. 

This  is  true  because  uncertainty  in  the  system  parameters  makes  the  formation 
of  an  error  model  impossible.  Therefore,  Equation  (74b)  is  dissimilar  to 
Equations  (73),  and  it  is  not  immediately  apparent  that  (73)  implies  (74). 
Indeed,  as  we  see  in  the  case  of  scalar  systems,  (73)  does  not  imply  (74)  for 
all  types  of  sets  in  the  class  under  consideration.  We  now  undertake  the 
study  of  this  problem  for  scalar  systems,  and  characterize  all  the  sets  in 
the  class  under  consideration  for  which  the  saddle  point  exists. 


4.3.  Scalar  Systems 

The  existence  of  saddle-point  solutions  for  scalar  systems  will  be 
studied  for  a  large  class  of  convex  sets.  The  simplicity  of  scalar  systems 
allows  for  a  more  thorough  investigation  of  the  properties  in  Theorem  2,  and 
therefore,  we  gain  better  insight  into  the  structural  properties  of  the 
problem  at  hand.  First,  we  will  show  how  Equations  (73)  may  be  used  to  char 
acterize  the  maximum  solution  for  the  class  of  convex  sets  that  satisfy  the 
stabilizability  and  detectability  assumptions.  Then  we  will  show  that 
Equation  (74)  can  be  verified  only  for  certain  sets  within  this  class. 

First,  the  class  of  convex  sets  will  be  described.  For  scalar 
systems,  the  parameter  vector  (F. ,G. ,H„)  can  have  at  most  three  scalar 

Q  U  U 

parameters  that  we  represent  by  (f.,g. ,h_).  Convex  sets  that  satisfy  the 

U  C7  0 

conditions,  (57),  consist  of  three  sets  of  intervals  on  the  real  line.  To 
satisfy  the  stabilizability  and  detectability  assumptions  we  require  that 
these  intervals  do  not  contain  the  point  zero.  To  allow  for  the  possibility 
that  one  or  two  parameters  are  known,  we  allow  one  or  two  of  these  intervals 
to  be  reduced  to  a  single  point.  Therefore,  the  class  of  sets  we  consider 
is  @  such  that: 


X  ={©:©=  0p  U  ©G  U  ©R}  ,  (75) 

and  ©  ,  ©  ,  ©  are  each  one  of  the  following: 
r  O  H 


0F  "  {f9€  [fa’V 

fa  <  fb  <  0} 

(76a 

or 

0F  *  {f9S  (fa’fb] 

0  <  fa  <  V 

(76b 

or 

«  (f«  -  f 

fen,  f#o} 

(76c 

G  =  {80€tV8b]  8a  <  *b  <  01  (7' 

or  eG  =  {gee[ga»gb]  0  <  8a  <  8b1  <7' 

or  ©G  =  {g0=  8  8  6  ®  *  g^Ol  (7/ 

®H  -  ha  ‘  \  '  01  (7S 

or  0H  '  ‘"e^W  0  ‘  ha  '  V  (7£ 

or  e  -  Ih  «  h  h  e  K  ,  h  i*  0)  (78 

ri  y 


and  we  need  not  consider  the  trivial  case  ©  =  {(f,g,h)}.  Note  that  the 
assumed  independence  of  f-.g.  and  hfl  allows  us  to  maximize  J(u  ,fQ,gn,h.J 

O  0  0  O  0  0  0 

with  respect  to  f  ,gn,  and  h„  one  at  a  time. 

y  y  y 

The  maximization  of  (48)  over  each  set  in  the  class,  X.can  be 

performed  by  using  its  Fre'chet  differentials  in  the  Condition  (73)  of 

Theorem  2.  We  will  consider  each  condition  in  (73)  one  at  a  time.  First, 

consider  Condition  (73a)  for  f  £@  and  with  ©_  as  in  (76a)  or  (76b)  and 

y  1?  r 

noting  that  Case  (76c)  is  trivial: 


[mFip  +  ZmF2](f0  -  f)  <  0  Vf0€©F 
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Equations  (80)  and  (81)  can  be  solved  bv  direct  division: 
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2  2 
8  P 


2(f  -  —■) 
l  r 


We  know  from  the  properties  of  optimal  linear  regulators  that  (f  -  ■“ — )  and 
o  w 

h2E  u 

(f - x— )  are  stable,  which  in  this  case  means  they  are  strictly  negative. 

£  r 

Therefore,  m^  and  m^  are  positive,  and  m^P  +  mp2^  Pos;i-tive*  Then, 
condition  (79)  is  equivalent  to 


^9  -  f  vf9eeF 


The  only  such  point,  f,  in  sets  in  (76a)  and  (76b)  is  at  the  upper 
boundary.  That  is, 


f  -  fb  f°r  £6£t£a.fbl  fa<fb<° 


£  -  £b  for  f9e  [fa,fb]  0  <  fa  <  £„ 


Now  consider  Condition  (73b)  for  gQS©„  and  with  9_  in  (77a)  and 

y  (j  G 

(77b)  : 
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(85)  can  be  solved  by  direct  division: 


2(f  -  -^2-) 
w 

u 
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Since  (f  -  •&-£-)  <0  ,  m_  <  0.  Now  consider  ©_  in  (77a)  t  In  this  case 

Wu  G 

g  <  0>  and  therefore  “ —  m  >  0.  Then  Condition  (84)  is  equivalent  to; 

W  vj 

u 


<  8  V8«e0i 


6  G 


Therefore,  g  =  g  .  With  ©  in  (77b),  g  >  0,  and  therefore  ■  m_  <  0. 

D  0  W  G 

U 

Then  Condition  (84)  is  equivalent  to: 


ge  >  g  Vgee0G 


Therefore,  g  occurs  at  the  lower  boundary,  g  .  Summarizing  the  results: 
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g  -  8b  for  g6s  [ga,gb]  8a  <  8b  <  0 


g  -  8a  for  g8e  [ga,gs]  0  <  8,  '  8b 


Now,  consider  Condition  (73c)  for  hQS©  and  with  ©  in  (78a) 


and  (78b)  : 


X'"H(h8'h>-0  ¥h8S9H 

l  r 


,2-  2 
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t  r 
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(89)  can  be  solved  by  direct  division: 


2 (f  -  ~) 
lx 


Since  (f  - — )  <0  ,  m„  <  0.  Now  consider  0  in  (78a).  In  this  case 

t2r  hj.2  ”  H 

h  <  0,  and  therefore  — m^  >  0.  Then  Condition  (88)  is  equivalent  to: 

l  r 

h0  i  h  Vh8£0H 

Therefore,  h  occurs  at  the  upper  boundary,  h,  .  With  ®  in  (78b),  h  >  0, 

y  D  rl 

tiZ  ^ 

and  therefore  m^  <  0.  Then  Condition  (88)  is  equivalent  to: 
r 

he  :  h  ¥1'9£eH 

Therefore  h  occurs  at  the  lower  boundary,  h  .  Summarizing  the  results: 

d 


h  =  hb  for  hg6  [ha,hb]  ha  <  hb  <  0 


(90) 


h  =  ha  for  h0€  [ha,hb]  0  <  ha  <  hb 


(91) 


(82)  and  (83) ,  (86)  and  (87)  ,  (90)  and  (91)  are  the  solutions  for  sets  (76) , 
(77)  and  (78),  respectively.  The  solution  for  sets  in  class  x  can  be  found  by 
applying  solutions  (82)  and  (83)  ,  (86)  and  (87) ,  (90)  and  (91)  one  at  a  time 
whenever  a  parameter  is  unknown.  This  completes  the  solution  of  the  maximin 
problem  for  sets  in  the  class,  x* 

We  now  wish  to  verify  the  saddle-point  condition  in  Theorem  2  by 
showing  the  relation,  (74),  holds.  It  will  be  seen  that  saddle  points  do  not 
exist  for  all  sets  in  the  class,  x*  However,  we  can  find  a  class  of  sets 
within  x  for  which  (74)  can  be  verified.  Recall  that  the  relation,  (74) ,  is 
part  of  the  saddle-point  condition,  (56),  in  view  of  the  expression  for 
J(uo,9)  in  (54).  Therefore,  to  verify  (74)  we  need  an  explicit  expression 
for  J(u  ,9)  =  J(u  ,f  ,g  ,h  ) .  Towards  this  goal  we  start  with  (54), 
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Solving,  and  after  some  rearrangement 
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We  substitute  (98)  into  (96)  and  (97),  and  then  into  (92)  to  find  an 
expression  for  J(u  ,f  ,g.,h.).  The  final  result  is 
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The  properties  of  Equation  (99)  may  be  used  to  investigate  the 
saddle-point  properties  of  the  maximin  solutions.  We  can  show  that  (99) 
is  decreasing  away  from  the  maximin  solution  locally,  so  that  there  always 
exist  some  sets  in  X  such  that  (74)  is  verified.  We  can  also  show  that  as 
the  sets  get  larger  the  saddle-point  condition  breaks  down  in  some  cases, 
since,  as  we  move  away  from  the  maximin  point,  (99)  may  start  to  increase 


again  until  its  value  becomes  greater  than  the  maximin  value. 
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To  show  that  (99)  is  decreasing  away  from  the  maximin  solution, 
we  first  note  that  since  J(uo>9)  in  (48)  is  the  optimal  cost  at  any  value 

of  9,  (99)  is  bounded  below  by  (48)  at  any  value  of  fQ,gQ  and  h„ .  From  this 

u  u  y 

fact,  and  the  fact  that  at  the  maximin  point  (99)  and  (48)  are  equal  and 
continuous,  the  derivatives  of  (99)  are  equal  to  the  derivatives  of  (48) 
at  the  maximin  point.  The  derivative  relations  (79),  (84)  and  (88)  in  view 
of  (80),  (81),  (85)  and  (89)  indicate  that  (99)  is  decreasing  as  we  move 
away  from  the  maximin  point  into  a  set  in  X.  Therefore,  there  exists  a  set 
belonging  to  X  in  which  (99)  is  decreasing  in  all  directions  from  the 
maximin  point  into  the  set.  Since  (99)  is  continuous  in  this  set,  (74) 
is  satisfied,  and  therefore  this  is  a  set  in  X  such  that  a  saddle  point 
exists . 

We  have  shown  that  there  exist  sets  in  x  such  that  the  maximin 
solution  satisfies  the  saddle-point  condition.  In  the  following,  we  show 
that  for  a  given  maximin  solution,  a  largest  set  in  the  class,  x»  can  be 
found  such  that  the  saddle-point  condition  holds.  This  set  is  found  by 
examining  the  topology  of  the  level  surface, 

J(uo,fe’g9,he^  =  J(u0>f>s>h)  ■  (ioo) 

We  show  that  this  equation  describes  the  boundary  of  the  largest  convex 
compact  set  such  that  the  saddle-point  condition  holds.  However,  this 
set  does  not  belong  to  the  class,  X,  under  consideration.  Therefore,  we 
find  the  largest  set  in  X  that  is  contained  in  this  largest  convex  set. 

We  find  that  there  are  never  any  limits  to  the  size  of  ©„,  but  there  are 

r 

always  limits  on  the  size  of  &r  and  0  . 


We  start  with  Equation  (100)  with  J(u  ,f,g,h)  given  by  (49): 
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Using  (99) ,  and  cancelling  out  the  common  denominator  on  both  sides  we 
the  result: 
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( 103e) 


(103f) 


(103g) 
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Equation  (102)  with  (103)  describes  a  surface  in  ®  ,  part  of  which  describes 
the  boundary  of  the  largest  convex  set  such  that  the  saddle-point  relations 
(74),  holds.  This  surface,  of  course,  contains  the  point  (f,g,h)  and  turns 
out  to  be  hyperboloid  in  shape,  although  Equation  (102)  is  not  quite 
quadratic.  To  describe  this  surface  precisely,  we  will  investigate  the 
contours  of  (102)  in  the  f  -g.  plane,  the  fQ-h^  plane,  and  the  g„-h,.  plane 
at  values  h.  1  h,  g.  -  g  and  fa  =  fs  respectively. 

We  start  by  investigating  the  contour  in  the  fg-gg  plane  with 
h  =  h.  This  situation  is  equivalent  to  taking  ©„  as  in  (78c).  Equation 

j  a 

(102)  then  reduces  to  a  quadratic  equation  of  the  form: 

2  2  °  2 
V9  +  Ve  +  K3hfe89  +  K4hg9  +  K680  +  K5h9  +  K7  =  0  *  (104) 


A  typical  graph  of  Equation  (104)  is  shown  in  Figure  15.  This  graph 
exhibits  three  basic  characteristics  we  need  in  order  to  construct  the 
largest  set  in  x  that  enjoys  the  saddle-point  property.  These 
characteristics  are: 

(la)  The  graph  of  (104)  is  a  hyperbola. 

(lb)  Curve  A  intersects  the  line  f4  =  f  at  exactly  one  other  point,  ga  =  STnax> 

where  is  always  in  a  set  ©r  in  x. 

TudX  U 
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(lc)  Curve  A  never  intersects  the  line,  gg  =  g,  at  a  point  other  than  the 
maximin  point,  (f,g,h). 

The  shaded  area  to  the  left  of  Curve  A  is  a  convex  set  in  the 
f0-gQ  plane,  and  is  the  largest  set  that  enjoys  the  saddle-point  property, 
since  we  have  already  shown  that  J(u  ,fa,gQ,ha)  is  decreasing  at  (f,g,h) 
in  all  directions  into  the  area  with  darker  shading.  However,  this  set 
is  not  rectangular  and  therefore  is  not  in  x» 

Characteristics  (la) ,  (lb)  and  (Ic)  enable  us  to  construct  the 
largest  set  in  x  that  enjoys  the  saddle-point  property.  This  set  is  shown 
in  Figure  15  with  darker  shading. 

It  remains  to  show  that  Characteristics  (la)  ,  (lb)  and  (Ic)  are 
true.  Necessary  and  sufficient  conditions  for  (la),  (lb)  and  (Ic)  to  be 
true  are: 

(11)  The  graph  of  Equation  (104)  is  either  a  hyperbola  or  an  ellipse. 

(12)  There  exist  two  asymptotes  of  J(uq , f ^ ,g0 ,h)  in  the  f0-gg  plane. 

They  are  a  pair  of  intersecting  lines. 

(13)  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such 
that  J(uQ,f,ge,h)  *  J(uQ,f ,g,h) . 

(14)  This  point  is  always  in  some  set  0  of  the  class,  x>  under  consideration, 
and  is  the  point  (f,gmax,h)  as  represented  in  Figure  15. 

(15)  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such  that 
J^uo’fe’8,h)  =  J(u0>f*8>h). 

This  point  is  always  on  the  opposite  side  of  the  asymptotes  of 
J(uq , f g ,gg ,h) ,  and  is  the  point  (fx>g,h)  on  Curve  B  in  Figure  15. 


(16) 
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Conditions  (II),  (12),  (15)  and  (16)  verify  Characteristic  (la). 

Conditions  (13)  and  (14)  verify  Characteristic  (lb) ,  and  Conditions  (12) , 

(15)  and  (16)  verify  Characteristic  (Ic) .  We  now  show  that  Conditions 

(II)  through  (16)  are  valid  and  discuss  the  precise  reason  why  they  prove 

that  Characteristics  (la) ,  (lb)  and  (Ic)  are  valid. 

Since  Equation  (104)  is  quadratic  in  f  and  gQ,  and  since  ^  0, 

Kc  ^  0,  K,  i  0  and  K.,  #  0,  its  graph  is  either  a  hyperbola  or  an  ellipse. 

5  6  7 

Therefore,  Condition  (II)  holds. 

From  Equation  (99)  we  see  that  the  denominator  equals  zero  for 
certain  values  of  fQ,g.  and  hQ.  Setting  the  first  factor  equal  to  zero  we 
find  that  an  asymptote  occurs  at 

f  +  f  -  •S-E-  -  — ~  =  0  .  (105) 

0  w  „2_ 


The  graph  of  (105)  in  the  f0-g0  plane  is  a  vertical  line.  Setting  the 
second  factor  in  (99)  equal  to  zero  we  find  that  another  asymptote  occurs  at 
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(106) 


The  graph  of  (106)  with  h0  =  h  in  the  f0-g0  plane  is  a  line.  Since  the  slope 
of  (106)  is  different  than  that  for  (105),  these  lines  intersect  at  some  point. 
Since  the  numerator  of  Equation  (99)  is  nonzero  for  values  of  f0  and  g0 
satisfying  (105)  and  (106),  Condition  (12)  holds. 

Equation  (104)  evaluated  at  fQ  =  f  is  quadratic  in  gQ,  and 
therefore  has  two  solutions.  One  solution  is  g0  =  g.  Since  this  solution 
is  real,  and  the  coefficients  of  (104)  are  real,  the  other  solution  is  also 
real,  and  Condition  (13)  holds. 


Equation  (99)  evaluated  at  fQ  =  f  and  h^  =■  h  is  of  the  form: 


J(uo,f,Sa,h)  = 
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(108c) 


(108d) 


We  now  show  that  Condition  (14)  holds.  We  consider  the  cases, 
g  <  0  corresponding  to  <=>  in  (77a)  and  g  >  0  corresponding  to  ©  in  (77b), 
separately. 

Case  (1):  g  <  0  ,  ©G  =  (gQe  ^ sa’ Sb  1  Sa  <  Sb  <  01 

We  have  shown  that  the  maximin  point  g  equals  g^  in  this  case.  Therefore, 

to  prove  the  validity  of  Condition  (14)  in  this  case,  it  is  necessary  to 

show  that  there  exists  a  g  6  (~°°,g)  such  that  J(u  ,f,g  ,h)  =  J(u  ,f,g,h). 

max  o  °max  o  ° 


First,  note  that  from  Equation  (84)  and  the  subsequent  discussion  we  have  shown 
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Therefore . 


+  ((f  .  si  .  !&,  >0 

w  lx  u  lx 

u 


(113) 


for  all  (f,g,h)e]R  with  f  #  0,  g  #  0,  or  h  #  0«  Using  (113)  in  (110)  we 


have  the  result 


g  -  gd  <  0 


(114) 


since  g  <  0.  Therefore,  J(u  ,f,gQ,h)  is  continuous  over  (-®,g).  Now  we  take 

O  O 

the  limit  of  J(u  ,f,g  ,h)  as  gQ-*--°°.  Using  Equation  (107), 

O  U  U 

IT  +  S  + 

lim  J(u  ,f,g  ,h)  =  lim  - rr - 

§0  ~°°  g0  -  -•  (f  +  A)(jp  +  G4) 


=  llm  (f  +  A)G. 

e  _  x?  4 

°9 


Using  the  definition  of  A  in  (101)  we  note  that 


f  +  A  =  f  -  +  f  -  <  0 

wu  *2r 


(115) 


2  2 

since  we  know  f  -  <  0  and  f  -  — <  0.  Therefore,  using  Equations  (108c) 

u  Ur 

and  (108d) ,  we  note 


(f  +  A) 


q-  >  0  for  g  •  0  and  g0  €  (-°°,g) 
4 


Therefore 


lim  J(uQ,f ,gg,h)  *  +  *  .  (116 

2  -*■  —  co 

se 

Since  J(u  ,f,gQ,h)  is  increasing  and  continuous  at  (f,g,h),  then 
o  0 

J(u  , f,g  ,h)  <  J(u  ,f,g,h)  for  some  neighborhood  (g  -  e,g),  e  >0.  Since 
o  y  o 

J (u  ,f,gQ,h)  is  continuous  over  (— 00 , g)  and  has  limit  +  »  as  g.  •+■-»,  there 

o  y  y 

exists  some  point  gmax€  (~“»g)  such  that  J(uo,f .g^.h)  =  J(uQ,f,g,h). 
Therefore,  Condition  (14)  is  true  for  g  <  0  and  with  ©G  given  as  in  (77a). 

Case  (2):  g  >  0  ,  ©G  =  {gQ€  [ga»gbl  0  <  8a  <  *b>  . 

We  have  shown  that  the  maximin  point  g  equals  g  in  this  case.  Therefore, 

3 

to  prove  Condition  (14)  in  this  case,  it  is  necessary  to  show  that  there 

exists  a  g  £  (g,”)  such  that  J(u  ,f,g  ,h)  -  J(u  ,f,g,h).  First,  note 
max  o  max  o 

that  from  Equation  (84)  and  the  subsequent  discussion  we  have  shown  that 

J(uQ,f,gg,h)  is  decreasing  at  (f,g,h).  Equation  (109)  gives  the  only  point, 

g , ,  where  J(u  ,f,g  ,h)  is  discontinuous.  Using  the  expression  for  g  -  g , 
a  o  y  q 

given  in  Equation  (110) ,  and  the  relation,  (113),  we  obtain  the  result 


g  -  gd  >  0 


since  g  >  0.  Therefore  J(u  ,f,g.,h)  is  continuous  over  (g,°°) 

o  y 


the  limit  of  J(u  ,f,gQ,h)  as  g. -*•<».  Using  Equation  (107), 

o  y  y 


gQ  +  C:  +  G3gQ 

lim  J(u  ,f,g  ,h)  =  lim  - rr - 

80  "  80""  (f  +  A)(^  +  G4) 


’6 


lim 


G3ge 


n  co 


(f  +  A)G. 


Now  we  take 
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lim  J(u  ,f,g  ,h)  =  +  *  (117) 

2  -V  co 

G3  8e 

Since  (T+  A)'  >  0  and  g“  >  0  for  g  >  o  and  gQ€  (g,»).  Since  J(uo,f,g9,h) 

is  decreasing  and  continuous  at  (f,g,h),  then  J(uo,f,ge,h)  <  J(u  ,f,g,h)  for 

some  neighborhood  (g,g  +  c)  ,  e  >0.  Since  J(u  »f,gQ,h)  is  continuous  over 

o  o 

(g,00)  $  and  has  limit  +  00  as  gQ->-oo>  there  exists  some  point  gmaxe  (g*°°)  such 
that  J(uQ>f »8max»h)  =  J(uo,f9g,h).  Therefore,  Condition  (14)  holds  for 
g  >  0  and  ©G  given  in  (77b). 

We  now  compute  g  using  Equation  (107)  with  (108) . 

LUciX 

J(Vf’gmaxsh)  *  J<Vf’g’h) 

G  +  *  G38L  G1  +  <=;*  +  ^ 

(f  +  A)(fA  +  G^^)  (f  +  A)(fA  +  G4g)  • 

Cross-multiplying,  and  cancelling  the  term,  f  +  A,  we  obtain 

<G1  +  G2*W  +  °3*Lx>  CtA  +  G4e)  ’  (G1  +  G28  +  °3e2)  (M  +  G^)  . 

By  expanding  this  and  factoring  out  the  already  known  factor,  g  ~  g, 
o  max 

<SW  -  S>tG3£A(W  +  8)  +  G3G46  *  G2fA  -  GjGj]  -  0  . 

Setting  the  second  factor  equal  to  zero,  and  solving  for  g 

max 

[G3fAg  +  G2fA  -  GjGJ 
Smax  *  G3(fA  +  G^g) 

Using  the  definitions  of  A  in  (101)  and  G^  through  G^  in  (108) ,  and  using  a 
few  manipulations,  we  obtain  the  final  result: 
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g. 


.  gf(f .  j&  .  s!%+  (f2  +  k!*2 

Hr  w  lx 
X 


w 


u 


.  2 

w  k  q 

2  }  +  ~iPE  (f 
Hr 


2  2 

g  P  hi.  2 

w  0  ' 

u 


H2r 


max 


If(f .  A.hi,  +  £$?t 

u  lx  w  l  r 


(118) 


Note  (118)  represents  g  for  both  Case  (1)  and  Case  (2) .  This  completes 

ulaA 

the  consideration  of  Condition  (14) . 

Equation  (104)  evaluated  at  g  =  g  is  quadratic  in  fQ ,  and 
therefore  has  two  solutions.  One  solution  is  fa  ■  f.  Since  this  solution 
is  real,  and  the  coefficients  of  (104)  are  real,  the  other  solution  is  also 
real,  and  Condition  (15)  is  true. 

We  now  show  that  Condition  (16)  is  true.  We  start  by  investigating 

the  asymptote  equations,  (105)  and  (106) .  Denote  fQ  in  (105)  by  f  ... .  Then, 

y  a± 

using  (105) 


2  2 

,  +  f  _£JL_k _l 

“dl  w  2 

u  Hr 


(119) 


Substracting  (119)  from  Relation  (115)  we  get  the  result 


f  <  f 


dl 


for  all  g0e0G  »  hg  *=  ®h 


(120) 


Denote  ffl  in  (106)  by  f^*  Then  using  (106) 


gQgPh0hZ 


w  H2r 
u 


d2 


2  2 

_  5_p  .  o* 

w 

u 


(121) 


-  (f  -  “  -  -V1) 

Hr 


We  now  show  that  f  >  f.  Comparing  f,„  to  f  and  using  (121), 


r 


iwf.v 
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gflgPhfihr 
i  -  ,e  +  t(f 


si 


h2E 


d2 


f  = 


w  £2 r 
u 


w 


*2r 


)] 


-  (f 


2  2 
.  o.  _  4 


w 


£  2r 


t^P^E  +  f  (f  _  Z2.  _  hi)  ] 

w  1  r  Wu  l  r 

f.~  -  f  >  ~ H - - - - -  for  all  gQ€0G  ,  hQ€©H 


d2 


-  (f 


2  2 

.  SJL  _  h_I) 


w  „2 

u  £  r 


2  2 

since  g  g  >  g  and  hnh  >  h  for  all  g  €  ©,,  and  h  £©„.  Therefore,  since 

2  2  “  au  OH 

f  -  <  0  and  Relation  (113)  holds,  f -  f  >  0.  That  is, 


w  02 

u  £  r 


f  <  f 


d2 


for  all  g0eeG  ,  h0  e  ©H 


(122) 


Equation  (121)  evaluated  at  h0  =  h  has  negative  slope  in  the  fg-g0  plane 


when  g  <  0  and  ©^  as  in  (77a),  and  positive  slope  in  the  f0-g0  plane  if 


g  >  0  and  ©G  as  in  (77b) .  The  graphs  of  (119)  and  (121)  are  a  pair  of 
intersecting  lines  oriented  as  shown  in  Figure  15.  From  Figure  15  we  can  see 


that  fd2  >  fdl  for  some  gQ  <  g,  when  g  <  0  or  for  some  g0  >  g  when  g  >  0. 


„+ 


We  will  take  the  limit  of  J(u  , f  ,g  ,h)  as  f  -*  f  .  at  a  value  of  gQ  such 

o  y  y  y  y 


that  f d2  >  f^^j  as  shown  in  Figure  15.  Examination  of  (99)  shows  that 


lim, 


lim,  J(u  ,f  ,g  ,h)  =  - , 

f  *  f+  0  0  6  f  ^  f+ 

0  d2  9  d2 


-1 


gnSh2^ 

(f0  +  A)(f  ,A  +  —  2  ) 


w  Z  r 
u 


22422  g2P2h2E2w  g2  f2g2P2h2E2  w  k2q 
g  P  h  E  k  q  _  xe9  .  86  .  x  M.2 


2w  2  4 r2 
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2  2 

2w  fr 
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2w  l2r 
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(123) 
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■  llh*.  A*'.  ... ■  tW. A.  i  fci  *u~ti  I id  i  flbil /Vl  <Vi  V)  Vi  Vi  I’kiihTuVUli  >  iW*>  A  A  1 


■4 

n 


“1 


.•1 


since  the  fa  term  and  the  gQhQ  term  (evaluated  at  hQ  =  h)  cancel  when 

0  0  0  y 

fQ  =  Now  since  we  are  operating  at  fQ's  that  are  to  the  right  of  the 

asymptotes  given  by  (119)  and  (121) ,  the  first  term  of  the  denominator  of 
the  right  side  of  (123)  is  positive,  where  before  it  was  negative,  and  the 
second  term  of  the  denominator  is  negative,  where  before  it  was  positive. 
Therefore,  the  denominator  is  negative,  and  inspection  of  (123)  shows  that 

the  numerator  is  also  negative,  so  that  J(u  , f.,g.,h)  is  positive  for  values 

o  y  y 

of  f  close  to  ft,.  Therefore, 

0  al 


lim+  J(uo,fQ,g0,h)  =  + 
f  6  -  fdl 


(124) 


Now  consider  the  portions  of  the  lines  (119)  and  (121)  that  are  right-most 

which  we  have  labeled  f,  in  Figure  15.  Since  J(u  ,fQ,g. ,h)  =  +  °°  on  the  line 

a  o  y  y 

segment  corresponding  to  Equation  (121),  and  J(u  ,f,,g.,h)  is  continuous  for 

o  y  y 

fQ  >  f,,  J(u  ,fQ,g.,h)  =  +  ”  on  the  line  segment  corresponding  to  (119). 

y  a  o  y  y 

Therefore, 


lim+  J (uq , f  q  , gg  ,h)  =  +  °°  for  all  gQ  S  ]R  .  (125) 

Vfd 


We  now  take  the  limit  of  J(u  ,fQ,g,h)  as  f+°°.  Equation  (99)  evaluated  at 

O  0  0 

g9  =  g  and  hQ  =  h  is  of  the  form: 


J(uo»fe>g’h) 


F  +  F  f  +  F  f 
1  2  9  3  8 

2  2 

(fa  +  A)(f,A  +  V  ") 


'0 


2 

w  l  r 
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(126) 


where 
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Taking  the  limit  of  J(u  ,f.,g,h)  using  (126), 

o  y 


lim  J(u  ,f  ,g,h) 

Cm  O  9 

f  .  -V  00 


F  F 
1  2 
— i  +  —  +  F 

2  f  3 

2  2 

>  A  o  Phi 

(i  +  -—ha  +  s  n  ;-) 

0  f  w  £" r 

y  u 


Using  the  definition  of  F^  in  (127c)  and  A  in  (101), 


2  2  2  2 
-PIT 


2wu£2r 

lim  J(u  ,f  ,g,h)  =  - ; - 5 — 

f  +co  °  0  g2p  h2r 

u  £  r 


We  now  compare  this  value  of  J(uo,f^,g,h) 
by  Equation  (49)* 


with  the  maximin  value  given 


f4  =  °° 


J(u  ,f,g,h)  -  J(u  ,f  ,g,h)  >  0  .  (129) 

O  O  u  - 

V” 

But  from  Equation  (125)  we  know 

J(u  ,f,g,h)  -  J(u  ,f  ,g,h)  ,  <  0  .  (130) 

O  O  0  c  _  c* 

0  =  d 

Therefore,  since  J(u  ,fQ,g,h)  is  continuous  over  (f,,°°),  there  exists  some 

o  a 

point  fQ  =  fx  in  the  interval  (f^,00)  such  that 
J(uQ,f ,g,h)  -  J(uQ,fx,g,h)  =  0 
Therefore,  Condition  (16)  holds. 

Condition  (16),  supported  by  Condition  (15),  shows  that  the  graph 
(104)  is  not  an  ellipse  since  the  graph  of  (104)  would  be  required  to  cross 
the  asymptotes  that  exist  due  to  Condition  (12).  Therefore,  by  Condition  (II) 
we  know  that  the  graph  is  a  hyperbola.  Therefore,  Characteristic  (la)  is  true. 
Conditions  (13)  and  (14)  are  equivalent  to  Characteristic  (lb) ,  and  Conditions 
(15)  and  (16),  supported  by  Condition  (12),  are  equivalent  to  Characteristic 
(Ic) .  This  completes  the  proof  of  the  validity  of  Characteristics  (la),  (lb) 


and  (Ic) . 
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Characteristics  (la),  (lb)  and  (Ic)  enable  us  to  construct  the 
largest  set  in  x  with  ©^  given  in  (78c) .  This  set  is  shown  in  Figure  15  as 
the  area  with  the  darker  shading. 

Next  we  investigate  the  contour  in  the  t  -  hQ  plane  with  gc  =  g. 

y  y  y 

This  situation  is  equivalent  to  setting  ©.,  as  in  (77c).  Equation  (102)  then 

(j 

reduces  to  a  quadratic  equation  of  the  form: 

Kife  +  K2fe  +  K38fehe  +  K4ghe  +  K5he  +  K6g2  +  K7  =  0  •  <131> 

A  typical  graph  of  Equation  (104)  is  shown  in  Figure  16.  This  graph  exhibits 
three  basic  characteristics  we  need  in  order  to  construct  the  largest  set  in 
X  that  enjoys  the  saddle-point  property.  These  characteristics  are: 

(TIa)  The  graph  of  (131)  is  a  hyperbola. 

(lib)  Curve  C  intersects  the  line  f  «  f  at  exactly  one  other  point,  hA  *  h  , 

0  7  9  max 

where  h  is  always  in  a  set  ©u  in  x- 

IH3.X  n 

(lie)  Curve  C  never  intersects  the  line,  hQ  =  h, at  a  point  other  than  the 

y 

maximin  point,  (f,g,h). 

We  see  that,  as  in  the  previous  case,  the  shaded  area  to  the  left  of 

o 

Curve  C  is  a  convex  set  in  the  fg  -  hg  plane,  and  is  the  largest  set  that 

enjoys  the  saddle-point  property  since  we  have  already  shown  that  J(u  ,fQ,gn,h^) 

o  y  y  y 

is  decreasing  at  (f,g,h)  in  all  directions  into  the  area  with  darker  shading. 
However,  this  set  is  not  rectangular  and  therefore  is  not  in  x- 

Characteristics  (Ila) ,  (lib)  and  (lie)  enable  us  to  construct  the 
largest  set  in  x  that  enjoys  the  saddle-point  property.  This  set  is  shown 
in  Figure  16  with  darker  shading. 
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It  remain?  to  show  that  Characteristics  (Ila)  ,  (lib)  and  (He)  are 
valid.  Necessary  and  sufficient  conditions  for  (Ila) ,  (lib)  and  (lie)  to  be 
true  are : 

(111)  The  graph  of  Equation  (131)  is  either  a  hyperbola  or  an  ellipse. 

(112)  There  exist  two  asymptotes  of  J(u  ,fQ,g,hQ)  in  the  f.  -  hQ  plane. 

They  are  a  pair  of  intersecting  lines. 

(113)  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such  that 

J (u0* f »g»hg)  *  J (u0» f  »g»h) • 

(114)  This  point  is  always  in  some  set  ©  of  the  class,  x>  under  consideration, 

ri 

and  is  the  point,  (f,g,hmay), represented  in  Figure  16. 

(115)  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such  that 
J(uo,f0’g,h)  =  J(u0>f.s*h). 

(116)  This  point  is  always  on  the  opposite  side  of  the  asymptotes  of 

J(u  , fQ,g,h  ),  and  is  the  point  (f  ,g,h)  on  Curve  D  in  Figure  16. 
o  y  y  x 

Conditions  (III),  (112)  ,  (115)  and  (116)  verify  Characteristic  (Ila). 
Conditions  (113)  and  (114)  verify  Characteristic  (lib) ,  and  Conditions  (112) , 
(115)  and  (116)  verify  Characteristic  (lie).  We  now  show  that  Conditions  (III) 
through  (116)  are  valid,  and  discuss  the  precise  reason  why  they  prove  that 
Characteristics  (Ila),  (lib)  and  (lie)  are  valid. 

Since  Equation  (131)  is  quadratic  in  and  h?,  and  since  f  0, 

^  0,  ^  0  and  t  0,  its  graph  is  either  a  hyperbola  or  an  ellipse. 

Therefore,  Condition  (III)  holds. 

To  prove  Condition  ( I 12 )  we  again  investigate  the  denominator  of 
(99)  by  setting  each  factor  equal  to  zero.  From  Equation  (105)  we  see  that 

one  asymptote  is  a  vertical  line  in  the  f  -  hn  plane.  From  Equation  (106) 

y  y 
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we  see  that  with  gQ  =  g,  its  graph  is  a  line  in  the  fQ  -  h0  plane.  Since  the 
slope  of  (106)  is  different  than  that  of  (105)  ,  these  lines  intersect  at 
some  point.  Since  the  numerator  of  (99)  is  nonzero  for  values  of  fn  and  hn 
satisfying  (105)  and  (106),  Condition  (112)  holds. 

Equation  (131)  evaluated  at  fQ  »  f  is  quadratic  in  hQ ,  and  therefore, 
has  two  solutions.  One  solution  is  hQ  =  h.  Since  this  solution  is  real,  and 
the  coefficients  of  (131)  are  real,  the  other  solution  is  also  real,  and 
Condition  (113)  indeed  holds. 

Equation  (99)  evaluated  at  f  =  f  and  g  =  g  is  of  the  form: 

0  0 


H  +  H  h  +  Hh 
J(uo,f,g,hQ)  (f  +  A)(fA  + 


(132) 


where 
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(133d) 


We  now  show  that  Condition  (114)  holds.  We  consider  the  cases. 


h  <  0  corresponding  to  ©  in  (78a)  and  h>  0  corresponding  to  0  in  (78b) 

”  H 


separately. 
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Case  (1) :  h  <  0  ®R  =  {  hQ  €  [ha,hb]  ha  <  <  0} 

We  have  shown  that  the  minimax  point,  h,  equals  h^  in  this  case.  Therefore, 
to  prove  the  validity  of  Condition  (114)  in  this  case,  it  is  necessary  to 

show  that  there  exists  a  h _ €  (-»  h)  such  that  J(u  »f,g,h  )  =  J(u  ,f,g,h). 

First,  note  that  from  Equation  (88)  and  the  subsequent  discussion,  we  have 
shown  that  J(uo,f,g,hg)  is  increasing  at  (f,g,h).  Now  consider  Equation  (106), 
evaluated  at  f  =  f  and  g  =  g,  which  yields  a  value  of  h,  which  we  denote 

y  U  U 


2  2 

. 

w  o  2  ; 

u  JL  r 


(134) 


w  l2r 
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(f*gshd)  is  the  only  point  where  J(uo,f,g,hQ)  is  discontinuous  in  hQ.  We  show 
that  h  <  hd  so  that  J(uo,f,g,hQ)  is  continuous  over  (-<*>,  h) .  Note  that 


h  -  h ,  = 
a 


2  2  2  2  " 
8-Jh-i  +  f  (f  _  &-L  . 

02  (  W  2  ' 

w  Z  r  u  l  r 
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Using  Equation  (113)  and  the  fact  that  h  <  0  we  have  the  result 


h  -  h  <  0 
a 


(136) 


Therefore,  J(uQ,f,g,hg)  is  continuous  over  (-<*>, h)  .  Now  we  take  the  limit 
of  J(u  ,f,g,h  )  as  h  -*>-<».  Using  Equation  (132), 


h7  +  H2  +  H3h0  H.ha 

lim  J(u  ,f,g,h  )  =*  lim  - — -  =  lim  TT^T'aMT  • 

h»  —  °  6  h.—  (£  +  *)#  +  H.)  h*-»  <f+A)H4 

w  y  hA  4  n 


Using  Relation  (115)  and  the  definitions  of  in  (133c)  and  in  (133d)  we 
note 


and 


Therefore, 


(f  +  A)  * 


0 


for  h  <  0  and  hQ  €  (-°°,h) 


lim  J(u  ,f,g,h  )  =  +  ®  .  (137 

h  -*■—  °  9 

0 

Since  J(u  ,f,g,hQ)  is  increasing  and  continuous  at  (f,g,h),  then 
o  y 

J(u  ,f ,g,h  )  <  J(u  ,f,g,h)  for  some  neighborhood  (h  -  e,h),  e  >  0.  Since 
o  y  o 

J (u  ,f ,g,h  )  is  continuous  over  (-°°,h),  and  has  limit  +  °°  as  hQ-*--°°,  there 

O  0  D 

exists  some  point  h  €(-00,h)  such  that  J(u  ,f,g,h  )  =  J(u  ,f,g,h). 
r  max  o  °  max  o 

Therefore,  Condition  (114)  is  true  for  h  <  0  and  with  given  as  in  (78a). 

Case  (2):  h  >  0  €>  =  (hQ€  [h  ,h,  ]  0  <  h  <  h,  } 

H  9  a  b  a  b 

We  have  shown  that  the  maximin  point,  h,  equals  h  in  this  case.  Therefore, 

to  prove  Condition  (114)  in  this  case,  it  is  necessary  to  show  that  there 

exists  an  h  G(h,°°)  such  that  J(u  ,f,g,h  )  =  J(u  ,f,g,h).  First,  note 
max  o  max  o 

that  from  Equation  (88)  and  the  subsequent  discussion,  we  have  shown  that 

J(u  ,f,g,hQ)  is  decreasing  at  (f,g,h).  Equation  (134)  gives  the  only  point, 

h, ,  where  J(u  ,f,g,hQ)  is  discontinuous.  Using  the  expression  for  h  -  h  , 
u  o  w  a 

given  in  Equation  (135),  the  Relation  (113),  and  the  fact  that  h  >  0,  we 


obtain  the  result : 


Therefore,  J(u  ,f,g,h.)  is  continuous  over  (h , ot>) »  Now  we  take  the  limit  of 
o  o 

J(u  ,f,g,h.)  as  h.-*-00.  Using  Equation  (132), 

O  U  U 


lim 

h  -*■  00 

0 


J  (u 

o 


f’g’V 


lim 

h  “►  00 

0 


S  +  H3he 

(f  +  A)  <|*  +  K4, 


lim 

h  ■+■ °° 

e 


H3h8 

(f  +  A)H4 


Therefore, 


lim 

V" 


J(uQ,f ,g,h0) 


+  OO 


(138) 


H3  hQ 

since  >  0  and  —  >  0  for  h  >  0  and  h0  S  (h,°°).  Since  J(uQ,f,g,h0)  is 

w  4 

decreasing  and  continuous  at  (f,g,h),  then  J(u  ,f,g,hA)  <  J(u  ,f,g,h)  for 

o  y  o 

some  neighborhood  (h,h  +  e) ,  e  >0.  Since  J (u  ,f,g,hQ)  is  continuous  over 

o  y 

(h ,°°) ,  and  has  limit  +  00  as  ha  -*•«,  there  exists  some  point  h  e(h,°°)  such 

0  max 

that  J(u  ,f,g,h  )  =  J(u  ,f,g,h).  Therefore,  Condition  (114)  holds  for 
o  max  o 

h  >  0  and  given  by  (78b) . 

We  now  compute  h  using  Equation  (132)  with  (133) . 
max 


J(u 

o 


f’8’hmax) 


J(uQ,f ,g,h) 


H.  +  H.h  +  H.h 
1  2  max  3  max 

(f  +  A) (fA  +  H.h  ) 
4  max 


Hx  +  H2h  +  H3h2 
(f  +  A)  (fA  +  H^h) 


Cross-multiplying,  and  cancelling  the  term,  f  +  A,  we  obtain 

(H  +  H.h  +  H.h2  )(fA  +  H.h)  =  (H7  +  H.h  +  H,h2)(fA  +  H.  h  ) 
1  2  max  j  max  4  1  2  3  4  max 


max 


■h, 


By  expanding  this  and  factoring  out  the  already  known  factor,  h 


(h  -  h) [H„fA(h  +  h)  +  H-H.hh  +  H_f A  -  H.H.  =  0 
max  3  max  3  4  max  2  14 


Setting  the  second  factor  equal  to  zero,  and  solving  for  h  , 

QlaX 


-  [H^f Ah  +  H2fA  -  HjHJ 
H3(fA  +  H4h) 


Using  the  definitions  of  A  in  (101) ,  and  through  in  (133) ,  and  using  a 
few  manipulations,  we  obtain  the  final  result: 

2  2 

o  5  2  g  y  w  r  2  2 

ur/r  o  _  a  x  8  PhZ  (f2  M  M  X  _  h_Z,2 


[f(f  -  -  V)  +  SlJy1 

u  l  r  w  l  r 
u 


(139) 


Note  that  (139)  represents  h  for  both  Case  (1)  and  Case  (2) .  This  completes 

max 

the  consideration  of  Condition  (114) . 

Note  that  Conditions  (115)  and  (116)  are  equivalent  to  Conditions 
(15)  and  (16)  which  we  have  shown  to  be  true.  Therefore,  Conditions  (115) 
and  (116)  hold. 

As  in  the  previous  case,  Condition  (116) ,  supported  by  Condition 
(115)  ,  shows  that  the  graph  of  (131)  is  not  an  ellipse  since  the  graph  of 
(131)  would  be  required  to  cross  the  asymptotes  that  exist  due  to  Condition 
(112) .  Therefore,  by  Condition  (III),  we  know  that  the  graph  is  a  hyperbola. 
Therefore,  Characteristic  (Ha)  is  true.  Conditions  (113)  and  (114)  are 
equivalent  to  Characteristic  (Hb)  ,  and  Conditions  (115)  and  (116)  ,  supported 
by  Condition  (112) ,  are  equivalent  to  Characteristic  (lie).  This  completes 
the  proof  of  the  validity  of  Characteristics  (Ila) ,  (lib)  and  (lie) . 
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Characteristics  (Ila),  (lib)  and  (lie)  enable  us  to  construct  the 
largest  set  in  x  with  given  in  (77c).  This  set  is  shown  in  Figure  16  as 
the  area  with  the  darker  shading. 

Finally,  we  investigate  the  contour  in  the  ga  -  hQ  plane  with 

D  O 

fQ  =  f.  This  situation  is  equivalent  to  taking  ©F  as  in  (76c).  Equation 
(102)  then  reduces  to  a  quadratic  equation  of  the  form: 

K689  +  (K3f  +  K4)g0h0  +  K5h0  +  Klf2  +  K2f  +  K7  =  0  *  (140) 

A  typical  graph  of  Equation  (140)  is  shown  in  Figure  17.  This  graph  exhibits 
three  basic  characteristics  we  need  in  order  to  construct  the  largest  set  in 
X  that  enjoys  the  saddle-point  property.  These  characteristics  are: 

(Ilia)  The  graph  of  (140)  is  a  hyperbola. 

(Illb)  Curve  E  intersects  the  line,  hQ  =  h,  at  exactly  one  other  point 

gfl  =  where  is  always  in  a  set  ©  in  x- 

n  max  max  C 

(IIIc)  Curve  E  intersects  the  line,  g^  =  g,  at  exactly  one  other  point 

h„  =  h  where  h  is  alwavs  in  a  set  ©„  in  x* 

9  max  max  H 

The  shaded  area  to  the  left  of  Curve  E  is  a  convex  set  in  the 
8q  ~  hg  plane,  and  is  the  largest  set  that  enjoys  the  saddle-point  property, 

since  we  have  already  shown  that  J(u  »fQ,gQ»hQ)  is  decreasing  at  (f,g,h)  in 

O  b  y  U 

all  directions  into  the  area  with  darker  shading.  However,  this  set  is  not 
rectangular  and  therefore  is  not  in  x* 

Characteristics  (Ilia),  (Illb)  and  (IIIc)  enable  us  to  construct 
the  largest  set  in  x  that  enjoys  the  saddle-point  property.  This  set  is 
shown  in  Figure  17  with  darker  shading. 
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It  remains  to  show  that  Characteristics  (Ilia) ,  (Illb)  and  (IIIc) 
are  valid.  Necessary  and  sufficient  conditions  for  (Ilia) s  (Illb)  and  (IIIc) 
to  be  true  are: 

(III1)  The  graph  of  (140)  is  either  a  hyperbola  or  an  ellipse. 

(III2)  The  graph  of  (140)  has  center  at  the  origin. 

(III'O  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such  that 
J(uQ,f ,gQ,h)  =  J(uQ,f,g,h). 

(III4)  This  point  is  always  in  some  set  of  the  class,  x,  under 

consideration.  This  is  point  (f,g _ ,h)  as  represented  in  Figure  17. 

max 

(III5)  There  always  exists  exactly  one  other  point  besides  (f,g,h)  such 

that  J(u  ,f,g,h  )  =  J(u  ,f,g,h). 
o  0  o 

(III6)  This  point  is  always  in  some  set  of  the  class,  x»  under  considera¬ 
tion.  This  is  point  (f»S>hmax)  as  represented  in  Figure  17. 
Conditions  (III1)  through  (III6)  verify  Characteristic  (Ilia) . 
Conditions  (III3)  and  (III4)  verify  Characteristic  (Illb),  and  Conditions 
(III5)  and  (III6)  verify  Characteristic  (IIIc).  We  now  show  that  Conditions 
(III1)  through  (III6)  are  valid,  and  discuss  the  precise  reasons  why  they 
prove  that  Characteristics  (Ilia),  (Illb)  and  (IIIc)  are  valid. 

Since  Equation  (140)  is  quadratic  in  gQ  and  hQ,  and  since 

y  y 

2 

K^f  +  K?f  +  ^  0,  ^  0  and  t  0,  its  graph  is  either  a  hyperbola  or  a 

ellipse.  Therefore,  Condition  (III1)  is  true. 

Since  there  are  no  terms  in  (140)  that  are  linear  in  gQ,  and  there 

y 

are  no  terms  that  are  linear  in  h. ,  the  graph  of  Equation  (140)  has  center 

y 

at  the  origin.  Therefore,  Condition  (III2)  is  true. 
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Conditions  (III3)  and  (III4)  are  equivalent  to  Conditions  (13)  and 
(14)  which  are  known  to  be  true.  Therefore,  Conditions  (III3)  and  (III4) 
are  true. 

Conditions  (III5)  and  (III6)  are  equivalent  to  Conditions  (113) 
and  (114)  ,  which  are  known  to  be  true.  Therefore,  Conditions  (III5)  and 
(III6)  are  true. 

Conditions  (III3)  through  (III6)  establish  the  existence  of  points 

(f,g  ,h)  and  (f,g,h  )  as  solutions  of  (140).  We  also  know  that  (f,g,h) 
nicix  max 

solves  (140).  Since,  by  Condition  (III2)  ,  we  know  the  graph  has  center  at 
the  origin,  the  graph  of  (140)  cannot  be  an  ellipse  since  we  cannot  find  an 
ellipse  centered  at  the  origin  that  intersects  these  three  points,  as  we  can 
easily  see  from  Figure  17.  Therefore,  by  Condition  (III1) ,  the  graph  of 
(140)  is  a  hyperbola  and  Characteristic  (Ilia)  is  true.  Conditions  (III3) 
and  (III4)  are  equivalent  to  Characteristic  (Illb) ,  and  Conditions  (III5)  and 
(III6)  are  equivalent  to  Characteristic  (IIIc) .  This  completes  the  proof  of 
Characteristics  (Ilia),  (Illb)  and  (IIIc). 

Characteristics  (Ilia),  (Illb)  and  (IIIc)  enable  us  to  construct 
the  largest  set  in  x  with  ®F  given  in  (76c).  This  set  is  shown  as  the  area 
in  Figure  17  with  the  darker  shading. 

We  can  combine  the  results  that  were  arrived  at  through  the  study 
of  the  contours  in  the  f*  -  g.  plane,  the  fQ  -  hQ  plane,  and  the  gn  -  h„ 
plane  at  values  hn  =  h,  gQ  =  g  and  t.  =  f,  respectively,  to  find  the  largest 
set  in  the  class,  x>  when  all  three  parameters  are  unknown.  All  parameter 
sets  that  are  subsets  of  this  set  enjoy  the  saddle-point  property.  The  result 
is  summarized  in  the  following  theorem. 


Theorem  3  s  Consider  a  subclass  5  of  the  class,  x>  such  that? 


*  ■  <®  =  ®FS  U  ®GS  °  SHS} 

and  ®Fg»  ®qs8  ®HS  are  eac^  one  t*ie  following,  respectively : 


0FS  =  {f0£  ^ fa,f ^  -  00  <  fa  <  f  <  0>  (141a) 

or  0ps  =  (fQ€  [fa,f]  0  <  fa  <  f}  (141b) 

or  ©ps  =  (f0  =  f  f €  IR  f  *  0}  (141c) 

0GS  =  {ge€  [ga’g]  gmax  ±  ga  -  8  <  0}  (142a) 

°r  ©GS  -  {gQ  e  tg,gb]  0  <  §  <  8b  <  gmax>  <142b> 

or  ©GS  =  {g0  =  g  g^lR  g  +■  0J  (142c) 


®HS  -  lh8e[ha>hl  h»ax:ha;ht0)  (U3a> 

or  ®HS  '  lh8*  lh'hbJ  0  ‘  h  i  hb  -  h»ax'  (U3b) 

or  ©HS  =  (hQ  =  h  he®  h  +  0}  (143c) 

where  g  is  given  by  Equation  (118),  and  h  is  given  by  (139).  The 
max  max 

following  results  hold: 

1)  (f»g,h)  is  the  solution  to  the  maximin  problem,  (9) » for  scalar  systems  of 
the  form  described  by  (3),  (4)  and  (5)  for  ©G«s. 

2)  The  minimax  problem, (8) ,  and  the  maximin  problem, (9),  for  scalar  systems 
of  the  form  described  by  (3),  (4)  and  (5)  have  the  same  solution  if  and  only 
if  ©  belongs  to  the  class,  described  above. 

This  theorem  describes  all  the  possible  parameter  sets  in  the  class 
X  such  that  a  saddle  point  exists  at  the  maximin  solution,  (f ,g,h) .  However, 


this  theorem  does  not  say  that  there  is  no  minimax  solution  for  sets  in  x  that 
are  not  in  $,  but  only  that  the  minimax  solutions  for  these  sets  do  not 
correspond  to  the  maximin  solutions. 

This  completes  our  solution  to  the  minimax  problem  for  scalar 
systems  by  the  method  of  solving  the  maxirain  solution  for  the  same  system.  We 
now  consider  the  examples  of  the  previous  section,  and  compare  the  performance 
of  the  minimax  controller  to  that  of  the  least  sensitive  controller. 


4.4.  Examples 

The  following  scalar  system  was  considered  in  Section  3,  which  we 
state  here  using  the  present  notation  for  the  uncertain  parameters : 


dxt  -  f9xtdt  +  geutdt  +  dwt 
dyt  =  x^dt  +  dv^ 


(32a) 

(32b) 


(33) 


Here  f ^  =  9^  and  g^  =  9 In  Example  4.1,  gQ  is  known  and  equal  to  1,  and 

f„  is  unknown.  In  Example  4.2,  fQ  =  1  and  gQ  is  unknown. 

u  y  y 

For  Example  4.1,  fQ  unknown  and  gQ  =  1,  we  introduced  a  uniform 

y  y 


distribution  on  the  following  sets  in  Example  3.1  of  Chapter  3: 
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©F1  =  {f0€  [0. 9,1.1] ) 

(144a) 

©F2  =  {f0e  [0.7, 1.3]} 

(144b) 

©F3  =  {f0e[O.5,1.5]} 

(144c) 

For  Example  4.2,  f_  =  1  and  gQ  unknown,  we 

introduced  a  uniform 

distribution  on  the  following  sets  in  Example  3.2  of 

Chapter  3: 

®Gi  "  {80€  C0.9, 1.1]} 

(145a) 

eG2  =  {ge€  t0*7’1’3^ 

(145b) 

©g3  =  [g0e  [0.5, 1.5]} 

(145c) 

Equations  (47)  represent  the  controller  corresponding  to  the  maximin 

solution, (f,g) ,  which  is  yet  to  be  determined.  For 

(47)  reduce  to: 

these  examples  Equations 

uq  =  -  gp  xt 

(146a) 

dxt  =  (f  -  g2P)xtdt  +  E(dyt  -  x^dt) 

(146b) 

2fP  -  g2P2  +1=0 

(146c) 

2f£  -  r2  +  1  =  0 

(146d) 

Consider  Example  4.1  with  g0  =  g  =  1,  and  the  parameter  sets  in 
(144) .  We  can  use  the  Frechet  differentials  of  Theorem  2  to  find  the  maximin 
solution  for  each  of  these  sets.  The  result  for  each  set  is  stated  in 
Part  (1)  of  Theorem  3.  The  maximin  solutions  are: 


0F1  = 

f 

=  1.1 

(14'a 

®F2  : 

f 

=  1.3 

(  1  -  ' 

0F3  : 

f 

-1.5 
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From  Theorem  3,  Part  (2),  we  know  the  sets  ©p^,  anc*  ®f3 

enjoy  the  saddle-point  property,  and  thus  Solutions  (147)  are  minimax 

solutions,  and  the  corresponding  controllers,  (146),  are  minimax  controllers. 

We  wish  to  investigate  the  performance  of  these  minimax  controllers 

as  the  parameter,  f^,  varies  in  the  sets  ©F^,  ®P2  and  ®F3*  To  facilitate 

this,  we  define  J(u  ,f.,gQ,l)  in  Equation  (99)  by  the  notation  J„w(fQ,gQ). 

We  use  Equation  (99)  to  calculate  J._,  for  each  example.  Then  the  performance 

MM 

measure  used  in  Chapter  3  is  used,  that  is,  a  graph  of  -  JQpT  as  fQ  or  gQ 
varies  in  their  respective  sets. 

To  evaluate  Equation  (99)  for  a  particular  value  of  f.  Equations 
(146c)  and  (146d)  must  be  solved  to  find  the  corresponding  P  and  E: 


P  *  f  +  A2  +  1 

I  -  f  +  A2  +  1 


(148a 

(148b 


Therefore,  for  f  *  1.1,  and  ©„..  as  in  (144a), 

r  X 

P  =  2.5866 
Z  =  2.5866 


Then,  Equation  (99)  reduces  to: 


22.3814  fl  -  95.4886  fQ  +  206.5586 
MMVV  (4.1732  -  fj  (6.6905  -  4.1732  fa) 

A  plot  of  *  JoPT^f9^  over  is  shown  in  Figure  18. 


(149) 


For  f  *  1.3,  and  ©^  in  (144b), 
P  =  2.9401 


E  =  2.9401 


Then,  Equation  (99)  reduces  to: 


37.3610  fg  -  173.4110  f  +  412.4891 
"WV  *  (4.5802  -  fj  (8.6442  -  4.5802  f.) 


(150) 


A  plot  of  JMM(f0)  ~  JoPT^f6^  over  t0*7*1*^  is  shown  in  Figure  19. 
For  f  -  1.5, 


P  -  3.3028 
I  -  3.3028 


Then  Equation  (99)  reduces  to: 

59.4976  fj  -  306.3235  f  +  786.5119 

JMM(f0)  =  (5.1056  -  fj (10.9085  -  5.1056  f.)  (151) 

y  y 

A  plot  of  ^^(fg)  ■  J0PT^fa^  over  [0*5, 1.5]  is  shown  in  Figure  20. 

Figures  18,  19  and  20  show  the  relative  cost  of  the  minimax 
controllers.  Also,  these  figures  show  the  relative  cost  of  the  least 
sensitive  controllers  we  previously  displayed  in  Figures  1,  2  and  3.  From 
these  plots,  we  can  compare  the  performance  of  the  minimax  controllers  with 
the  performance  of  the  least  sensitive  controllers.  Several  observations  can 
be  made. 

1)  Recall  that  the  minimax  controllers  are  optimal  at  the  worst-case  parameter 
value,  whereas  the  least  sensitive  controllers  are  not  optimal  at  any  parameter 
value  in  the  intervals  under  consideration. 

2)  The  least  sensitive  controllers  exhibit  superior  performance  over  a  wider 
range  of  parameter  values  than  the  minimax  controllers  exhibit.  This  property 
can  be  quantified  as  the  percentage  of  full  range  that  a  particular  controller 
exhibits  superior  performance. 


Minimax 


Least 

Sensitive 


v-v 
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v# , 


« 


*£ 


c 


0_  %  range  (least  sensitive)  %  range  (minimax) 

r 


[0.9, 1.1]  77.5%  22.5% 
[0.7. 1.3]  82.3%  17.7% 
[0.5,1.53  84.2%  15.8% 


From  these  figures  we  can  see  that  the  least  sensitive  controllers  exhibit 
superior  performance  for  an  increasing  relative  range  of  parameter  values 
as  less  parameter  sensitivity  is  desired. 

3)  The  relative  difference  between  the  maximum  relative  cost  for  the 
minimax  controllers  and  the  least  sensitive  controllers  decreases  as 
less  sensitivity  is  desired.  The  relative  maxima  as  a  percentage  of 
J^Onax)  -  J0pT(max)  are: 


0_  %  difference 

F 


[0. 9,1.1]  62.5% 
[0.7,1. 3]  40.9% 
[0.5, 1.5]  17.0% 


Note  that,  as  will  be  seen  in  the  second  example,  this  trend  will  continue. 
As  we  widen  the  range  of  parameter  values,  the  maximum  relative  cost  of 
the  minimax  controller  is  actually  less  than  the  maximum  relative  cost  of 
the  least  sensitive  controller. 

These  observations  are  useful  when  considering  which  design  is 
best  for  a  particular  application.  However,  discussion  of  design  considera¬ 
tions  is  postponed  until  after  the  investigation  of  Example  4.2. 

Now  consider  Example  4.2  with  fg  ■  f  ■  1  and  the  parameter  sets 
in  (145).  We  can  use  the  Frdchet  differentials  of  Theorem  2  to  find  the 
maximin  solution  for  each  of  these  sets.  The  result  for  each  set  is  stated 
in  Part  (1)  of  Theorem  3.  The  maximin  solutions  are 


I 


We  now  calculate  gmay  for  each  of  these  sets  using  Equation  (118) 
To  evaluate  (118)  we  need  the  corresponding  P  and  £  for  each  g  in  (152) . 
From  Equations  (146c)  and  (146d)  we  can  evaluate  P  and  £: 


P  ,  L±JL±. 
*  2 
g 

r  =  i  +  Ji 


Using  Equations  (118)  to  evaluate  g  ,  the  results  are: 

@G1  :  8  ’  °'9  <W  ‘  9-5726 

0G2  ;  8  ■  °'7  8ma,  ‘  U-3463 

®03  :  8'°-5  8^-  14-8239 

Therefore,  by  Theorem  3,  Part  (2),  the  sets  ®G2  an(i  enj°y  the 

saddle-point  property,  and  thus  Solutions  (152)  are  minimax  solutions,  and 
the  corresponding  controllers,  (146),  are  the  minimax  controllers. 

It  is  interesting  to  note  that,  for  this  example,  we  enjoy  a 
rather  liberal  choice  of  parameter  sets  in  the  sense  that  as  the  deviation 
from  the  mean  increases,  the  upper  bound  on  the  set  also  increases,  so 
that  we  are  limited  by  the  lower  bound,  g0  *  0,  rather  than  the  upper  bound. 


g„  ■  g  .  Therefore,  for  this  example,  all  the  desired  deviations  from  the 
9  max 

mean ,  up  to  100%,  have  minimax  controllers  that  we  can  design  using  the 


method  described  in  this  section. 


To  evaluate  J^j^gg)*  Equation  (99)  is  used  with  P  and  E  for  each 
g  given  by  Equations  (153a)  and  (153b) . 

For  g  *  0.9,  and  ©G1  in  (145a), 

P  =  2.8955 
E  =  2.4142 

Then  Equation  (99)  reduces  to: 

19.7902  gg  +  127.6514  g0  -  29.6352 
JMM(g0)  "  2.7596  (3.7596  -  6.2913  gfl)  ‘  (1 

A  plot  of  -^(gg)  ~  J0PT^8e^  0Ver  t0. 9, 1.1]  is  shown  in  Figure  21. 

For  g  =  0.7,  and  ©G2  in  (145b), 

P  *  4.5320 
E  *  2.4142 

Then  Equation  (99)  reduces  to: 

29.3287  g0  +  224.6223  gQ  -  43.1607 
JMM(g9)  =  2.6349  (3.6349  -  7.6588  gQ)  ‘  (1 

A  plot  of  ^^(gg)  ~  JoPT^g0^  over  [0.7,1. 3]  is  shown  in  Figure  22. 

For  g  =  0.5,  and  ©G3  in  (145c), 

P  -  8.4721 
E  =  2.4142 


Then  Equation  (99)  reduces  to: 


JMM^ge^ 


(156) 


52.2926  gg  +  539.8936  g0  -  75.6519 
2.5322  (3.5322  -  10.2267  g  ) 

A  plot  of  -^(gg)  -  J0pT(Se)  over  (0.5, 1.5]  is  shown  in  Figure  23. 

Figures  21,  22  and  23  show  the  relative  cost  of  the  minimax 
controllers.  Also  these  figures  show  the  relative  cost  of  the  least  sensitive 
controllers  that  are  displayed  in  Figures  8,  9  and  10.  From  these  plots,, 
similar  observations  can  be  made  as  in  the  previous  example. 

1)  Again,  the  minimax  controllers  are  optimal  at  the  worst-case  parameter 
value,  and  the  least  sensitive  controllers  are  not  optimal  for  any  value. 

2)  The  least  sensitive  controllers  exhibit  superior  performance  over  a 
wider  range  of  parameter  values.  The  percent  of  full  range  that  a  particular 
controller  exhibits  superior  performance  is  shown  below: 

©G  %  range  (least  sensitive)  %  range  (minimax) 

[0.9, 1.1]  78.55%  21.45% 

[0.7, 1.3]  85.93%  14.07% 

[0.5, 1.5]  90.91%  9.09% 

Again  the  least  sensitive  controllers  exhibit  superior  performance  for  an 
increasing  percentage  of  the  full  range  of  parameter  values  as  less  parameter 
sensitivity  is  desired. 

3)  The  relative  difference  between  the  maximum  relative  cost  for  the  minimax 
controllers  and  the  least  sensitive  controllers  decreases  as  less  parameter 
sensitivity  is  desired.  The  relative  maxima  as  a  percentage  of 

JMM(max)  •  JLS(nax) 


are : 


Least 

Sensitive 


%  difference 


[0.9,1.11  57.63% 

[0.7,1.31  3.39% 

[0.5.1.5]  ~  102.86% 

Here  the  results  are  much  more  dramatic  than  in  the  previous  example.  For 
the  set,  [0.5 ,1.5],  the  maximum  relative  cost  for  the  least  sensitive  controller 
is  over  100%  higher  than  the  maximum  relative  cost  of  the  minimax  controller. 

One  cause  for  the  difference  in  the  behavior  of  the  relative 
maxima  of  the  two  examples  may  be  seen  by  referring  back  to  the  plots  of 
Jqpt  for  the  two  examples  given  in  Figures  4  and  11.  As  can  be  seen  from 
these  plots,  the  least  sensitive  controller  must  compensate  for  much  higher 
worst-case  costs  when  gQ  *  0.  is  unknown.  Also  these  worst-case  costs  increase 
more  rapidly  as  the  parameter  set  is  widened  for  the  gQ  *  02  case. 

This  completes  our  observations  for  the  two  examples.  These 
observations  are  useful  when  one  has  to  make  a  choice  between  the  two  designs 
for  a  particular  application.  From  Observations  (2)  and  (3)  of  the  two 
examples,  one  may  detect  a  conflict  in  objectives.  On  the  one  hand,  the 
least  sensitive  controllers  exhibit  superior  performance  over  the  wider  range 
of  parameter  values.  On  the  other  hand,  the  least  sensitive  controllers  can 
exhibit  very  poor  performance  over  the  worst-case  values  of  parameters  when 
the  variance  is  substantial.  If  this  worst-case  performance  is  intolerable, 
one  may  prefer  the  minimax  controllers  over  the  least  sensitive  controllers. 
Therefore,  either  controller  design  may  be  preferable  over  the  other,  depending 


on  the  application. 


4.5.  Conclusions 


In  conclusion,  this  section  has  dealt  with  the  design  of  minimax 
controllers  that  minimize  the  worst-case  cost.  An  equivalence  between  the 
solution  of  the  minimax  problem,  (8) ,  and  the  solution  of  the  maximin  problem, 
(9),  has  been  sought.  Problem  (9)  was  shown  to  have  a  solution  which  could 
be  found  by  setting  the  Frechet  differentials  in  Theorem  2  nonpositive. 

Theorem  2  also  gave  the  requirements  for  the  equivalence  of  the  minimax 
problem, (8),  and  the  maximin  problem,  (9) .  We  noted  that  the  method  of  proof 
of  the  equivalence  of  (8)  and  (9)  when  the  uncertainties  were  in  the  noise 
covariance  given  by  Looze,  Poor,  et  al.  [7]  could  not  be  used  here  since 
that  proof  relied  on  the  creation  of  an  error  model  that  could  not  be  formed 
in  this  case  because  the  uncertainties  were  in  the  dynamics  of  the  state 
model.  Indeed  we  found  that  in  the  scalar  case  there  were  examples  where 
(3)  and  (9)  were  not  equivalent. 

The  scalar  problem  was  then  considered.  The  maximin  solutions  for 
a  class  of  sets,  x>  was  found  by  setting  the  Frdchet  differentials  of  Theorem  2 

3 

nonpositive.  We  found  the  largest  convex  set  in  1R  such  that  (8)  and  (9)  are 
equivalent  by  examining  the  level  surface  of  (99)  at  the  maximin  solution. 

This  surface  is  a  half-hyperboloid  defined  by  (102) .  By  examining  the 

contours  of  (102)  in  the  fQ  -  gQ  plane,  the  fQ  -  hQ  plane  and  the  gQ  -  hQ 

y  y  y  y  y  y 

plane  at  values  of  h.  *  h,  gQ  *  g  and  f„  =  f,  respectively,  we  were  able  to 

y  y  y 

find  all  the  sets  in  x  such  that  (8)  and  (9)  were  equivalent.  This  result 
was  summarized  in  Theorem  3. 


Finally  the  examples  of  Chapter  2  were  considered,  and  Theorem  3 
was  used  to  show  that  the  solutions  to  (8)  and  (9)  were  equivalent  in  these 
examples .  The  performance  of  the  minimax  controllers  was  compared  with 
the  performance  of  the  least  sensitive  controllers  of  Chapter  2.  It  was 
seen  that  depending  on  the  design  objectives  either  controller  may  be 
preferable.  Specifically,  the  least  sensitive  controllers  were  seen  to 
exhibit  superior  performance  over  a  wider  range  of  parameter  values,  but 
the  minimax  controllers  had  lower  maximum  cost  when  the  relative  range  of 
parameter  values  was  substantial. 
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CHAPTER  5 


A  TWO-DIMENSIONAL  SYSTEM 

In  this  chapter,  we  consider  the  design  of  least  sensitive  and 
minimax  controllers  for  a  two-dimensional  system  with  different  uncertainties. 
These  examples  serve  to  present  some  important  ideas  about  the  design 
procedures  and  considerations  for  multivariable  systems  that  were  not 
illustrated  in  the  previous  examples . 

The  design  of  least  sensitive  controllers  is  considered  first.  The 
design  procedure  for  these  examples  is  the  same  as  that  of  Chapter  3.  However, 
with  multivariable  systems,  the  designer  has  the  option  of  reducing  the  order 
of  the  controller  dynamics,  thus  perhaps  saving  on  some  fixed  costs.  Therefore, 
the  analysis  of  these  examples  will  concentrate  on  the  relative  performance  of 
the  reduced-order  controllers  and  the  full- order  controllers. 

Next,  the  design  of  minimax  controllers  is  considered.  We  saw  in 
Chapter  4  that  the  design  procedure  for  minimax  controllers  for  multivariable 
systems  in  a  general  sense  was  not  feasible,  particularly  in  being  able  to 
show  the  equivalence  of  the  minimax  solution  to  the  maximin  solution  through 
verification  of  the  saddle-point  condition  of  Theorem  1,  Chapter  4.  However, 
this  chapter  shows  how  the  design  may  be  carried  out  numerically  for  a  specific 
example.  First,  the  maximin  controller  can  be  found  using  a  plot  of  the 
optimal  cost  for  each  value  of  the  unknown  parameter.  Once  the  maximin 
solution  is  obtained,  we  can  verify  the  saddle-point  condition  by  investigating 
a  plot  of  the  cost  associated  with  the  use  of  the  maximin  controller  for 
each  value  of  the  unknown  parameter. 


Finally  performances  of  the  minimax  and  the  least  sensitive 
controllers  are  compared  under  various  circumstances ,  and  the  considerations 


one  should  make  in  choosing  a  particular  design  are  discussed. 


5.1.  Formulation  of  Examples 

We  consider  a  two-dimensional  system  in  canonical  form  with  four 
possible  unknown  parameters : 


(157, 


(157 


The  nominal  parameter  vector  is  (©^  ©2  ©3  9^)"  “(1-111)".  We  consider 
four  examples,  each  with  only  one  parameter  unknown.  In  each  example,  we 
consider  intervals  with  end  points  that  are  10%,  30%  and  50%,  off  the 
nominal  value. 


Example  5.1 
Example  5.2 
Example  5.3 
Example  5.4 


©^  unknown,  0£  ■  -  1,  ©^  =  1,  ©^ 
01  *  1,  ©2  unknown,  ©^  ■  1,  ©^  * 
©1  =  1,  ©2  =  -  1,  ©^  unknown,  ©^ 
©1  ”  i,  ©2  *  '  1’  e3  = 


*  1 


1 

=  1 


,  »* ,v 


■.A  _  . 


/- l*- ;•  .*• i. 

A.*,.'-.  fa.  m  ilil  His 


“v% 


1 ,  0  ^  unknown 


K 


Least  Sensitive  Controllers 


The  gradient  algorithm  developed  in  Chapter  3  and  given  in  the 
Appendix  can  be  used  to  find  first  and  second-order  least  sensitive 
controllers  for  these  examples.  However,  an  adequate  starting  point  must  be 
fo-~ud  in  order  to  use  the  algorithm. 

As  a  starting  point  for  second-order  controllers,  we  use  the  optimal 
controller  for  the  nominal  system  in  the  block  diagonal  coordinates,  (10) .  The 


nominal  system  is: 


.:)  ■  0 


dy  *  (1  0) 


udt  + 


dt  +  dv 


(158a) 


(158b) 


The  optimal  controller  for  this  system  is  [3]: 


where 


0  1 


1  -1 


and  P 


P  P 
1  2 


u  =  —  (0  l)Px 


0  0\  fl  0\  / 1  0\ 

IP  -  Z[  )  xdt  +  l(  I dy 

o  i  J  Vo  o/  Vo  0/ 


satisfies : 


(159) 


(160) 


E 


(162) 

(163a) 

(163b) 

(163c) 

(164) 

(165a) 

(165b) 

(165c) 


With  P  and  £  given  in  (164)  and  (166) ,  Equations  (159)  and  (160)  reduce  to 


u  -  -  (2.4142  1.6131)x  I 

(-1.732  1  \  /  1.732l\ 

lidt  +  (  )  dy  ( 

-2.4142  -2.6131/  V  1  / 

In  the  block  diagonal  coordinatesi (10) ,  the  optimal  controller  is  of  the 


u  *  -  (0  l)z 


0  1 


P1  P2, 


zdt  +  I  Idy 


We  match  the  Y(s)  to  U(s)  transfer  functions  of  these  two  controllers. 


controllers  represented  by  (167)  and  (168) , 


-  -  (2.4142  1.6131) 

Y(s) 


s  +  1.7321 


-1  '\Vl.7321 


2.4142  s  +  2.6131 


-  (2.4142 


1.6131)  /s  +  2.6131  1  \  /1.7321  \ 

'  -  2.4142  s  +  1.7321'  '  1  / 


s  +  4.3452  s  +  6.9403 


5.7946  s  +  9.3897)s 


s  +  4-3452  s  +  6-9403 


For  controllers  represented  by  (169)  and  (170) 


(172) 


U(s) 

Y(s) 


(p4s  +  PiP3^s 


2 

s  -  P2S  ~  Px 


Comparison  of  (171)  and  (172)  gives  the  desired  starting  point  for  the 
gradient  algorithm: 


Pj.  ■  -  6.9403 
P2  *  -  4.3452 
P3  -  1.3529 
P4  -  -  5.7947 


(173) 


To  find  a  starting  point  for  first-order  controllers,  we  find  the  Y(s)  to  U(s) 
transfer  function  for  the  first-order  controller  and  attempt  to  match  this 
transfer  function  to  (172) .  The  first-order  controller  is 


u  =  z 

dz  =  p^zdt  +  p2dy  • 

Then, 

M  .  .  _2.1 

Y(s)  s  -  px 


(174) 

(175) 


(176) 


To  facilitate  a  comparison  between  (176)  and  (172),  we  attempt  to  cancel  the 
zero  in  (172) 


.V 


We  compare  (176)  to  (177)  and  set 


P2  -  p4  -  -  4.1743 


P1P3 

p  =  p  + 

P1  P2  p. 


-  5.7947 


(178) 


where  we  have  neglected  the  remainder  in  the  denominator  of  Equation  (177) . 
However,  this  remainder  is  not  negligible,  so  we  seek  an  improvement  in  this 
starting  point.  We  can  use  the  gradient  algorithm  to  make  this  improvement 
by  noting  that  choosing  a  degenerate  single  point  distribution  corresponds 
to  the  case  where  the  parameter  value  is  known.  Therefore,  by  using  (178) 
as  a  starting  point  for  the  gradient  algorithm  applied  to  the  nominal  system, 
a  more  exact  starting  point  is  found.  This  point  is 

P1  =  -  5.9235 

P2  =  -8.3097  .  (179) 

Note  that  these  are  the  control  parameters  corresponding  to  the  optimal 
first-order  controller  for  the  nominal  system.  We  use  these  parameters  as 
the  starting  point  for  determining  first-order  controllers  using  the  gradient 


algorithm. 


We  now  consider  Che  design  and  analysis  of  least  sensitive 
controllers  for  Examples  5.1  through  5.4,  for  various  intervals  spread 
evenly  about  the  nominal  values.  As  in  Chapter  3,  uniform  distributions 
are  induced  on  these  intervals.  The  same  type  of  approximating  distributions 
is  found  using  Equations  (34),  (35a),  (39)  and  (41)  of  Chapter  3,  choosing 
N=3.  We  can  use  this  low  choice  of  N  since  the  corresponding  controllers 
have  similar  performances  as  controllers  designed  for  N  large  have. 

Therefore,  we  can  save  on  computations  without  losing  much  information 
pertinent  to  our  analysis . 

Again  we  choose  as  a  performance  measure  a  plot  of  the  difference 
between  the  cost,  incurred  by  the  least  sensitive  controller,  and 

the  optimal  cost,  J0PT^9^ ’  for  eacil  value  the  parameter  over  the  range 
of  uncertainty.  This  is  a  plot  of  the  additional  cost  incurred  by  the  least 
sensitive  controller  over  the  minimum  cost  if  the  parameter  were  known. 

Now  consider  Example  5.1: 

unknown,  02  =  -  1,  =  1,  8^  =  1 

with  the  uniform  distribution  induced  on  the  10%,  30%  and  50%  spreads: 

9X  -  U[0.9,l.l] 

91  -  U£0.7,1.3]  (180) 

0X  -  U[0.5,1.5] 

The  gradient  algorithm  is  used  to  calculate  first  and  second-order  least 
sensitive  controllers  for  these  distributions ,  along  with  the  corresponding 
average  cost.  The  results  are: 


Distribution 

Controller 

Order  p  ^ 

P  2 

?  3 

P  4 

JAVG 

U[0. 9,1.1] 

1 

-5.9266 

-8.4104 

28.6159 

2 

-7.2769 

-4.3981 

1.3724 

-5.8667 

27.2015 

U[0. 7,1.3] 

1 

-5.9822 

-9.100 

31.4066 

2 

-9.1375 

-4.7185 

1.4852 

-6.3612 

30.0375 

UfO.5,1.5] 

1 

-6.1337 

-10.1914 

36.1568 

2 

-11.3105 

-5.1174 

1.6304 

-7.1322 

34.7152 

The  plots  of  1^(9)  “  ^oPT^  f°r  ^irst  an<*  secon<i“order 
controllers  are  shown  in  Figures  24,  25  and  26  for  each  distribution.  Several 
observations  can  be  made  from  these  curves. 

1)  The  performance  of  the  second-order  controllers  exhibits  most  of  the 
properties  observed  in  Chapter  3,  which  are  related  to  the  basic  shape  of  the 
performance  curve.  Since  the  first-order  controllers  have  the  same  basic 
shape,  they  also  exhibit  these  properties.  These  properties  include  the  fact 
that  less  performance  is  sacrificed  at  near  nominal  values  than  at  off  nominal 
values,  and  that  the  controllers  are  biased  towards  parameter  values  associated 
with  higher  costs. 

2)  The  savings  in  using  a  second-order  controller  over  a  first-order  controller 
relative  to  the  total  average  cost  of  the  first-order  controller  is  significant, 
but  relatively  low.  We  can  quantify  this  by  looking  at  the  percent  decrease 

in  the  average  cost  of  the  second-order  controllers  relative  to  the  cost  of 
the  first-order  controllers. 


Distribution 


U[0.9,l.l] 

U[0 . 7 ,1. 3] 

U[0.5 ,1 . 5 ] 

3)  Since  we  are  obligated  to  incur  at 


Decrease  in 

5.2% 

4.6% 

4.2% 

least  the  optimal  cost  of  each  value  of 


the  parameter,  there  is  a  maximum  amount  one  can  save  over  the  average  cost  of 


Figure  26.  J^g  -  vs.  8^  for  1st  and  2nd-order  controllers, 

9.  -  U[0.5 ,1 . 5 ] ;  and  -  JATJ(T,  vs.  8,  ,  9,  €  [0.5,1.51 
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the  first-order  controllers.  Figures  24,  25  and  26  show  that  large  percentages 
of  the  maximum  possible  savings  are  realized  by  the  second-order  controllers. 

For  each  distribution,  we  find  the  percentage  of  the  maximum  possible  savings 
over  the  first-order  costs  that  the  second-order  controllers  realize. 

Distribution  Percent  of  Maximum  Savings 

U[0.9,l.l]  81.9% 

U[0.7,1.3]  34.6% 

U[0.5,1.5]  21.1% 

Note  that  this  percentage  decreases  as  the  uncertainty  increases  so  that  the 
two  controllers  are  more  comparable  as  less  sensitivity  is  desired. 

4)  From  Figures  24,  25  and  26  we  can  see  that  the  difference  between  the 
first  and  second-order  controllers  is  greater  for  near  nominal  values  of  the 
parameter  and  less  for  off  nominal  values.  Therefore,  more  performance  is 
sacrificed  at  near  nominal  values  than  at  off  nominal  values  when  the  first- 
order  controllers  are  used  instead  of  second-order  controllers. 

These  observations  show  that  the  first-order  controllers  have  fairly 
good  performance  relative  to  the  total  cost  incurred  by  each  type  of  controller, 
but  that  the  second-order  controllers  can  realiSe  a  large  amount  of  the  total 
possible  improvement  in  performance.  Also,  the  first  and  second-order 
controllers  become  more  comparable  as  the  range  of  uncertainty  increases. 

We  will  discuss  what  these  observations  mean  when  one  is  considering  a  specific 
design  after  the  remaining  examples  are  analyzed. 

Before  proceeding  to  the  next  example,  note  that  Observations  (1) 
and  (4)  above  will  generally  hold  for  the  remaining  examples.  Therefore,  we 
will  concentrate  on  observations  similar  to  (2)  and  (3)  in  the  discussions  of 


the  remaining  examples. 


Next,  consider  Example  5.2: 


@1  =  1,  02  unknown,  0^  ■  1,  0^  *  1 


with  the  uniform  distribution  indiced  on  10%,  30%  and  50%  spreads: 


02  -  U[-l.l,-0.9] 

02  -  Ul-1.3,-0.7]  (181) 

©2  ~  U[-1.5,-0.5] 


The  gradient  algorithm  is  used  to  calculate  first  and  second-order  least 
sensitive  controllers  for  these  distributions,  along  with  the  corresponding 
average  cost.  The  results  are: 


Distribution 

Controller 

Order  p 

P2 

p3 

p4 

JAVG 

U[ -1.1, -0.9] 

1 

-5.9628 

-8.3610 

28.3333 

2 

-6.9373 

-4.3413 

1.3521 

-5.8081 

26.8616 

U[ -1.3, -0.7] 

1 

-6.2947 

-8.7943 

29.2138 

2 

-6.9181 

-4.3118 

1.3452 

-5.9166 

27.4057 

U[-1.5,-0.5] 

1 

-7.0611 

-9.7934 

31.2339 

2 

-6.9046 

-4.2621 

1.3048 

-6.1376 

28.5435 

The  plots  of  J  _(0)  -  J  (0)  for  first  and  second-order  controllers 
Lb  OrT 

are  shown  in  Figures  27,  28  and  29  for  each  distribution.  Two  similar 
observations  to  those  of  Example  5.1  will  be  made  here. 

1)  We  again  calculate  the  decrease  in  average  cost  of  the  second-order 
controllers  relative  to  the  average  cost  of  the  first-order  controllers. 


Distribution 


Decrease  in  J 


AVG 


U[ -1.1, -0.9] 
OT-1.3,-0.7] 
U[-l . 5  ,-0. 5 ] 


5.2% 

8.1% 

8.6% 


Note  that  as  in  the  previous  example,  the  savings  are  significant,  but 
relatively  low. 

2)  As  in  the  previous  example  we  calculate  the  percentage  of  the  maximum 
savings  over  the  first-order  controller  that  the  second-order  controller 
realizes. 


Distribution  Percentage  of  Maximum  Savings 


U[-l.l,-0.9] 
U[-1.3,-0.7] 
U[-1.5 ,-0.5] 


98.3% 

88.7% 

80.4% 


Note  that  these  results  are  much  more  dramatic  than  those  in  Example  5.1, 
with  only  a  slight  decrease  in  savings  as  more  parameter  insensitivity  is 
desired. 

Thus  our  observations  in  this  example  are  similar  to  Example  5.1 
except  that  the  improvement  of  second-order  controllers  in  this  example  is 
generally  greater.  This  greater  improvement  may  be  a  result  of  the  fact  that 
control  effort  for  ©3  has  more  effect  on  the  stable  subspace  of  the  states 
and  thus  the  performance  is  more  responsive  to  the  greater  control  effort  of 
the  second-order  controllers . 

Next,  we  consider  Example  5.3: 


©1=1,  ©2  =  -  1,  ©^  unknown,  9^=1 


with  the  uniform  distribution  induced  on  10%,  30%  and  50%  spreads: 


©3  -  U[0.9,l.l] 


©3  ~  U [0 . 7 ,1 . 3] 


(182) 


©3  -  U[0.5 ,1.5] 


i 


J 


C.. 


The  gradient  algorithm  is  used  to  calculate  first  and  second-order  least 
sensitive  controllers  for  these  distributions,  along  with  the  corresponding 
average  cost.  The  results  are: 


Distribution 

Controller 

Order  p^ 

p2 

p3 

p4 

JAVG 

U[0.9,l.l] 

1 

-5.9157 

-8.4212 

28.7810 

2 

-7.5486 

-4.4366 

1.3792 

-5.8814 

22.4062 

U[0.7 ,1.3] 

1 

-5.9351 

-9.3592 

33.5228 

2 

-12.2170 

-5.2161 

1.5533 

-6.6179 

32.4224 

Ut0.5,1.5] 

1 

-6.2351 

-11.6023 

45.3055 

2 

-22.2779 

-6.9242 

1.8494 

-8.3323 

44.3532 

The  plots  of  JLS(9)  ”  J0PT^  f or  first  and  second-order  controllers 
are  shown  in  Figures  30,  31  and  32  for  each  distribution.  Again  we  make  the 
same  types  of  observations. 

1)  We  calculate  the  decrease  in  average  cost  of  the  second-order  controllers 
relative  to  the  average  cost  of  the  first-order  controllers. 


Distribution 


Decrease  in  JAVG 


U[0.9,l.l]  4.8% 
U[0.7 ,1.3]  3.3% 
U[0.5 ,1.5]  2.1% 


Again,  the  savings  is  significant,  but  relatively  low. 

2)  We  calculate  the  percentage  of  the  maximum  possible  savings  over  the 
first-order  controllers  that  the  second-order  controllers  realize. 


Distribution  Percent  of  Maximum  Savings 

U[0. 9,1.1]  75.9% 

U[0 . 7 ,1. 3]  18.1% 

U[0.5,1.5]  3.3% 


Note  that  as  the  variance  increases  the  relative  savings  that  the  second- order 
controller  realizes  drops  to  almost  nothing.  Indeed,  we  can  see  from  Figure  32 


~  JqPT  vs.  63  f°r  1st  and  2nd-order  controllers, 
U[0.9,l.l] ;  and  -  JnI5_  vs.  9,,  e,e  [0.9,1.11 
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that  the  performances  of  the  first  and  second-order  controllers  are  almost 
identical  for  the  U[0.5,1.5]  distribution. 

We  will  see  that  the  results  for  this  example  are  similar  to  those 
of  Example  5.4.  We  now  consider  Example  5.4: 

@1  =  1,  02  =  “  =  unknown 

with  the  uniform  distribution  induced  on  10%,  30%  and  50%  spreads: 

04  -  U[0.9,l.l] 

04 ~  U[0.7,1.3]  (183) 

04  -  U[0.5,1.5] 

The  gradient  algorithm  is  used  to  calculate  first  and  second-order  least 
sensitive  controllers  for  these  distributions,  along  with  the  corresponding 
average  cost.  The  results  are: 


Distribution 

Controller 

Order  p  ^ 

P  2 

P3 

P4 

JAVG 

U[0. 9,1.1] 

1 

-5.9154 

-8.U29 

28.7238 

2 

-7.5431 

-4.4347 

1.3778 

-5.8746 

27.3502 

U[0.7,1.3] 

1 

-5.9273 

-9.2932 

33.0063 

2 

-12.2456 

-5.2146 

1.5449 

-6.5604 

31.9225 

U[0 . 5 ,1.5 ] 

1 

-6.2092 

-11.4348 

43.7246 

2 

-22.4724 

-6.9572 

1.8310 

-8.2123 

42.8125 

The  plots  of  “  ^OPT^  ^°r  ^irst  anc*  second-order 

controllers  are  shown  in  Figures  33,  34  and  35  for  each  distribution. 

Again,  we  make  the  same  types  of  observations. 

1)  We  calculate  the  decrease  in  average  cost  of  the  second-order  controllers 
relative  to  the  average  cost  of  the  first-order  controllers. 


ill 


Distribution 


Decrease  in  J 


AVG 

U[0.9,l.l]  4.8% 

U[0.7,1.3]  3.3% 

U[0.5,1.5]  2.1% 

Again,  the  savings  is  significant,  but  relatively  low. 

2)  We  calculate  the  percentage  of  the  maximum  possible  savings  over  the 
first-order  controllers  that  the  second-order  controllers  realize. 

Distribution  Percent  of  Maximum  Savings 

U[0.9,l.l]  76.8% 

U[0 .7,1.3]  15.2% 

U[0.5,1.5]  4.2 % 

Note  that  as  in  Example  5 . 3  the  relative  savings  realized  by  the  second-order 
controller  drops  to  almost  nothing,  and  we  can  see  from  Figure  35  that  the 
performances  of  the  first  and  second-order  controllers  are  almost  identical. 

Examples  5.3  and  5.4  lead  to  very  similar  results.  This  is  true 
for  several  reasons.  The  canonical  form  of  the  system  and  the  use  of  identical 
spreads  around  the  same  nominal  value  make  the  role  of  9^  very  similar  to  0^ 
in  the  transfer  of  the  observation  information  to  the  control  effort. 

From  these  four  examples  we  have  discovered  that  the  use  of  the 
second-order  controllers  can  lead  to  large  savings  over  the  use  of  first- 
order  controllers,  but  only  in  terms  of  the  total  savings  possible.  The 
losses  incurred  by  using  the  first-order  controllers  relative  to  the  actual 
coverage  cost,  however,  is  less  significant.  Also  the  two  controllers  become 
more  similar  as  the  variance  increases,  as  in  Examples  5.3  and  5.4  where  they 


are  seen  to  be  almost  identical  in  one  case. 


These  observations  are  useful  when  one  is  considering  which  order 
to  choose  in  the  design  process.  Basically,  if  the  fixed  costs  associated 
with  increasing  the  controller  dimension  are  greater  than  the  resulting 
savings  in  operating  costs,  one  should  probably  want  to  choose  the  first-order 
controller.  Therefore,  for  higher  dimensional  systems,  the  controller  used 
should  be  that  which  balances  the  fixed  cost  of  increasing  the  controller 
order  with  the  savings  in  operating  costs  that  the  increase  realizes.  This 
concludes  our  consideration  of  the  design  of  least  sensitive  controllers  for 
these  examples. 

5.3.  Minimax  Controllers 

In  this  section  we  seek  to  find  minimax  controllers  for  these 
examples.  In  Chapter  4  we  saw  that  the  solution  to  the  minimax  problem,  (8),  is 
equivalent  to  the  solution  to  the  maximin  problem, (9) ,  if  the  saddle-point 
condition, (56) , of  Theorem  1  holds.  We  saw  that  this  condition  is  equivalent 
to  showing  that 

JMM<9)  ;  JMM(V  (184> 

where  3^(9)  is  the  maximin  cost  over  the  parameter  set  and  9q  is  the  maximin 
solution  point.  In  Chapter  4  we  saw  that  it  is  difficult  to  find  general 
conditions  that  guarantee  the  satisfaction  of  (184) .  In  this  section,  we 
illustrate  how  numerical  procedures  can  be  used  to  design  maximin  controllers 
and  to  verify  (184)  when  the  particular  example  is  already  defined. 

The  maximin  solution  occurs  at  the  value  of  the  unknown  parameter 
that  maximizes  the  optimal  cost,  Jne_(0) .  Therefore,  the  maximin  solution  can 


be  found  by  inspecting  a  plot  of  over  the  parameter  range.  Once  the 

maximin  solution  is  found,  and  since  we  can  numerically  check  that  S  in  (50a) 
is  stable  in  these  examples,  condition  (184)  may  be  proved  by  inspecting  a 
plot  of  over  the  parameter  range. 

We  use  this  numerical  method  of  design  in  Examples  5.1  through  5.4 
and  compare  the  performances  of  the  resulting  controllers  to  those  of  the  least 
sensitive  controllers. 

We  start  with  Example  5.1: 

9^  unknown,  =  -  1,  9^  =  1»  =  1 

with  10%,  30%  and  50%  spreads: 

9^  [0.9, 1.1] 

9X€  [0.7, 1.3] 

9^  [0.5, 1.5] 

A  plot  of  dQpT(9^)  for  9^£  [0.2, 1.8]  is  shown  in  Figure  36. 
we  see  that  the  maxima  over  intervals  (185)  occur  at 


(185) 


From  this  plot 


9 


10 


1.1 


9 


10 


1.3 


(186) 


9 


10 


1.5 


We  can  use  the  gradient  algorithm  to  calculate  the  corresponding  maximin 
controllers  in  the  block  diagonal  coordinates.  The  results  are: 


Interval 

P1 

P2 

P3 

P4 

[0.9, 1.1] 

-7.3819 

-4.4681 

1.4608 

-6.5788 

[0.7, 1.3] 

-8.3046 

-4.7127 

1.6794 

-7.9796 

[0.5, 1.5] 

-9.2709 

-4.9540 

1.9009 

-9.6529 

27.828 

32.853 

39.970 


A  plot  of  J»-w(9i)  is  shown  in  Figure  37  for  all  three  intervals.  From  these 
MM  1 

curves  we  can  see  that 


WV  ;  -WV 


for  each  of  the  intervals  (185)  and  corresponding  maximin  points  (186) . 
Therefore,  these  are  minimax  controllers  also. 

Figures  24,  25  and  26  show  the  plots  of  Jv_,(6n)  -  JrtTl™(0-,)  that  we 

MM  1  UJrT  1 

use  as  our  typical  performance  measure.  Note  we  can  make  similar  observations 
as  we  did  in  the  case  of  least  sensitive  controllers. 

1)  We  calculate  the  decrease  in  average  cost  of  the  minimax  controllers 
relative  to  the  average  cost  of  the  fi.st-order  controllers. 


Interval  Decrease  in  J.„_  (Minimax)  Decrease  in  J.„„  (2nd  Order) 

AVG  AVG 

[0. 9,1.1]  2.8%  5.2% 

[0.7, 1.3]  -4.6%  4.6% 

[0.5, 1.5]  -10.5%  4.2 % 


Note  that  only  for  the  [0.9, 1.1]  case  are  the  minimax  controllers  better  than 
the  first-order  least  sensitive  controllers,  and  the  minimax  controllers  are 
never  better  than  the  second-order  least  sensitive  controllers. 

2)  We  calculate  the  percentage  of  the  maximin  possible  savings  over  the 
first-order  controllers  that  the  rainimax  controllers  realize. 


Interval  %  of  Max.  Savings  (Minimax)  %  of  Max.  Savings  (2nd  Order) 

[0. 9,1.1]  45.8%  81.9% 

[0.7, 1.3]  -36.8%  34.6% 

[0.5,1.53  -49.5%  21.1% 


Again  we  see  that  only  in  the  [0.9, 1.1]  case  are  the  minimax  controllers 
better  than  the  first-order  controllers,  and  the  minimax  controllers  are 


never  better  than  the  second-order  controllers. 


Figure  37.  JwwCQi)  vs.  9.  for  various  maximin  controllers 


In  this  example  we  see  that  the  performances  of  the  minimax  controllers 
are  again  generally  inferior  to  the  performances  of  least  sensitive  controllers, 
even  with  order  reduction.  We  see  that  this  is  also  true  in  the  other 
examples. 

Next  we  consider  Example  5.2: 

0^  =  1,  @2  unknown,  0^  ■  1,  0^  *  1 

with  10%,  30%  and  50%  spreads: 

02e  [-1.1, -0.9] 

02e  [-1.3, -0.7] 

02e  [-1.5, -0.5] 

A  plot  of  Jopt^92^  for  92€  -0.2]  is  shown  in  Figure  38. 

we  see  that  the  maxima  over  intervals  (187)  occur  at 


(187) 


From  this  plot 


0 


20 


-0.9 


0 


20 


-0.7 


(188) 


0 


20 


-0.5 


e 


We  can  use  the  gradient  algorithm  to  calculate  the  corresponding  maximin 
controllers  in  the  block  diagonal  coordinates.  The  results  are: 


Interval 

P1 

P2 

P3 

P4 

JAVG 

[-1.1, -0.9] 

-7.0685 

-4.3528 

1.3354 

-6.0950 

26.927 

[-1. 3,-0. 7] 

-7.3982 

-4.3916 

1.3016 

-6.8252 

27.945 

[-1.5, -0.5] 

-7.8451 

-4.4654 

1.2704 

-7.7766 

29.976 

A  plot  of  Jww(90)  is  shown  in  Figure  39  for  all  three  intervals.  From  these 
MM  L 


curves  we  can  see  chat 


for  each  of  the  intervals  (187)  and  corresponding  maximin  points  (188). 
Therefore,  these  are  minimax  controllers  also. 


Figures  27,  28  and  29,  show  the  performance  of  the  minimax 
controllers.  We  again  make  the  same  observations. 

1)  We  calculate  the  decrease  in  average  cost  of  the  minimax  controllers 
relative  to  the  average  cost  of  the  first-order  controllers. 


Interval  Decrease  in  J.„_  (Minimax)  Decrease  in  J.„„  (2nd-0rder) 

AVG  AVG 

[-1.1, -0.9]  5.0%  5.2% 

[-1.3, -0.7]  4.3%  8.1% 

[-1.5, -0.5]  4.0%  8.6% 


Note  here  that  in  all  three  cases  the  minimax  controllers  are  better  than  the 
first-order  least  sensitive  controllers,  but  are  not  as  good  as  the  second- 
order  controllers. 

2)  We  calculate  the  percentage  of  the  maximin  possible  savings  over  the  first 
order  controllers  that  the  minimax  controllers  realize. 


Interval  %  of  Max.  Savings  (Minimax)  %  of  Max.  Savings  (2nd -Order) 

[-1.1. -0.9]  94.0%  98.3% 

[-1.3, -0.7]  63.2%  88.7% 

[-1.5, -0.5]  42.0%  80.4% 


Again,  we  see  that  the  minimax  controller  is  better  than  the  first-order 
controllers,  but  not  as  good  as  the  second-order  controllers. 

In  Examples  5.1  and  5.2  we  saw  that  the  maximin  controllers  were 
indeed  equivalent  to  the  minimax  controllers.  In  the  next  two  examples  we  see 


cases  where  this  is  not  true. 


Consider  Example  5.3: 


9^  ■  1,  Sj  ■  -  1,  6^  unknown,  0^  =  1 


with  10%,  30%  and  50%  spreads: 


e3e  [0.9, l.i] 

03<=  [0.7, 1.3]  (189) 

@3 €  (0.5, 1.5] 


A  plot  of  J0PT^93^  ^°r  93G  [0*2, 1.8]  is  shown  in  Figure  40.  From  this  plot 
we  see  that  the  maxima  over  intervals  (189)  occur  at 


0 


30 


0.9 


9 


30 


0.7 


9 


30 


0.5 


(190) 


We  can  use  the  gradient  algorithm  to  calculate  the  corresponding  max imin 
controllers  in  the  block  diagonal  coordinates.  The  results  are: 


Interval 

P1 

P2 

P3 

P4 

JAVG 

[0.9, 1.1] 

-6.7615 

-4.2817 

1.4940 

-6.2355 

28.420 

[0.7, 1.3] 

-6.4389 

-4.1675 

1.8976 

-7.5453 

41.525 

[0.5, 1.5] 

-6.1749 

-4.0743 

2.6272 

-10.0216 

107.885 

A  plot  of  J, (9.)  is  shown  in  Figure  41  for  all  three  intervals.  From  these 
MM  3 

curves  we  can  see  that 


JMM(V  -  JMM(930^ 


only  for  the  interval,  [0.9, 1.1].  Therefore,  the  maximin  solution  corresponds 
to  the  minimax  solution  only  for  this  interval.  We  can  see  from  the  plots  of 


Figure  41 


lit 


vs.  9.  for  various  maximin  controllers 


the  performance  of  these  controllers  in  Figures  30,  31  and  32  that  the 
performance  of  the  controllers  is  not  good,  although  there  are  no  marked 
dissimilarities  from  cases  where  the  maximin  controllers  are  equivalent  to 
the  minimax  controllers.  Again,  we  make  our  observations. 

1)  We  calculate  the  decrease  in  average  cost  of  the  maximin  controllers 
relative  to  the  average  cost  of  the  first-order  controllers. 

Interval  Decrease  in  (Maximin)  Decrease  in  (2nd-0rder) 

[0.9.1.1]  1.3%  4.8% 

[0.7,1.31  -23.8%  3.3% 

[0.5,1.5]  -138.2%  2.1% 

As  can  be  seen,  the  performance  of  the  two  maximin  controllers  that  do  not 
exhibit  the  saddle  point  property  compare  very  poorly. 

2)  We  calculate  the  percentage  of  the  maximin  possible  savings  over  the  first 
order  controllers  that  the  maximin  controllers  realize. 

Interval  %  of  Max.  Savings  (Maximin)  %  of  Max.  Savings  (2nd-0rder) 

[0.9,1.11  19.8%  75.9% 

[0.7,1.31  -147.1%  18.1% 

[0.5,1.51  -157.5%  3.3% 

Again,  the  poor  performance  of  the  maximin  controllers  that  do  not  exhibit 
the  saddle-point  property  is  evident . 

We  should  expect  to  find  a  similar  situation  in  Example  5.4  as  in 
Example  5.3,  and  indeed  we  do.  Consider  Example  5.4: 

01=1,  02  *  -  1,  0^=1,  0^  unknown 


with  10%,  30%  and  50%  spreads: 


94e  {0.9, 1.1] 


(191) 


04€  [0.7, 1.3] 
04e  [0.5, 1.5] 


A  plot  of  JqpT^0^  ^or  04€  [0«2,1.8]  is  shown  in  Figure  42.  From  this  plot 
we  see  that  the  maxima  over  intervals  (191)  occur  at 

\o  -  0.9 

\0  ‘  °-7  <192) 
°40  '  0-5 

We  can  use  the  gradient  algorithm  to  calculate  the  corresponding  maximin 
controllers  in  the  block  diagonal  coordinates .  The  results  are : 


Interval 

P1 

P2 

P3 

P4 

JAVG 

[0.9, 1.1] 

-6.7020 

-4.2678 

1.4906 

-6.1664 

23.322 

[0.7, 1.3] 

-6.2585 

-4.1239 

1.8828 

-7.2761 

40.640 

[0.5, 1.5] 

-5.8769 

-4.0005 

2.5894 

-9.4004 

85.132 

A  plot  of  Is  shown  in  Figure  43  for  all  three  intervals.  From  these 

curves  we  can  «ee  that 


JMM(V  -  JMM(940) 


only  for  the  interval,  [0.9,1.!].  Therefore,  as  in  Example  5.3,  the  other 


-  A 


two  controllers  are  not  minimax  controllers,  and  they  exhibit  the  same  type 
of  poor  performance.  Again,  we  make  the  observations. 

1)  We  calculate  the  decrease  in  average  cost  of  the  maximin  controllers 
relative  to  the  average  cost  of  the  first-order  controllers. 


■  ^ 


X 


Interval  Decrease  in  J.„_  (Maximin)  Decrease  in  J.„„  (2nd-0rder) 

AVG  AVG 

[0.9, 1.1]  1.4%  4.8% 

[0. 7,1.3]  -23.1%  3.3% 

[0.5, 1.5]  -94.7%  2.1% 


2)  We  calculate  the  percentage  of  the  maximum  possible  savings  over  the  first 
order  controllers  that  the  maximin  controllers  realize. 


Interval  %  of  Max.  Savings  (Maximin)  %  of  Max.  Savings  (2nd-0rder) 

[0. 9,1.1]  22.3%  76.8% 

[0.7, 1.3]  -149.4%  15.2% 

[0.5, 1.5]  -56.9%  4.2% 

As  in  Example  5.3,  the  maximin  controllers  that  do  not  satisfy  the  saddle- 
point  condition  exhibit  poor  performance. 

We  have  seen  from  these  examples  that  although  the  minimax  (or 
maximin)  controllers  are  not  better  than  the  second-order  least  sensitive 
controllers,  there  are  many  cases  where  they  are  superior  to  the  first-order 
least  sensitive  controllers,  particularly  when  the  intervals  are  relatively 
small. 

Note  that  in  few  cases  is  the  maximum  cost  over  the  interval  greater 
for  the  minimax  controllers  than  for  the  least  sensitive  controllers  (see 
Figure  27  for  an  exception) ,  so  that  this  design  consideration  is  not  as  much 
of  an  issue  in  these  examples. 


5.4.  Conclusions 

In  conclusion,  the  design  of  least  sensitive  and  minimax  controllers 


for  a  second-order  system  was  considered  in  this  chapter.  The  dimension  of 
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the  system  allowed  for  the  investigation  of  the  effects  of  reduced-order 
least  sensitive  controllers,  and  also  provided  an  illustration  of  a  numerical 
design  procedure  for  minimax  controllers. 

In  the  case  of  the  design  of  least  sensitive  controllers,  it  was 
seen  that  some  performance  is  sacrificed  for  order  reduction.  Furthermore, 
the  sacrifice  in  performance  is  on  the  average  a  small  percentage  of  the  total 
average  cost  if  one  used  an  optimal  controller  at  each  value  of  the  parameter. 
However,  a  large  percentage  of  this  additional  cost  can  be  removed  using 
full-order  controllers  in  these  examples.  The  choice  of  controller  order 
was  seen  to  be  a  trade-off  between  the  extra  fixed  costs  for  increasing  the 
order  and  the  lower  operating  costs  this  increase  realizes. 

In  the  case  of  the  design  of  minimax  controllers ,  the  dimension  of 
the  system  provided  an  illustration  of  a  numeric  design  procedure  that  can  be 
performed  if  one  has  a  specific  example.  The  maximin  solutions  were  found 
using  a  plot  of  the  optimal  cost  over  the  range  of  uncertainty.  The  saddle- 
point  condition  was  verified  using  a  plot  of  the  cost  incurred  by  the  maximin 
controller  over  the  range  of  uncertainty.  Some  examples  where  the  saddle  point 
did  not  exist  were  found,  and  the  corresponding  controllers  were  seen  to  exhibit 
poor  performance.  Finally,  ic  was  seen  that  the  minimax  controllers  generally 
have  poorer  performance  than  the  second-order  least  sensitive  controllers,  but 
in  many  cases  have  better  performance  than  the  first-order  least  sensitive 


controllers . 
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CHAPTER  6 


CONCLUSIONS 


This  thesis  has  considered  the  design  of  controllers  for  linear 
stochastic  systems  where  the  corresponding  models  have  uncertain  parameters. 

In  Chapter  1,  we  discussed  several  ways  of  handling  these  uncertainties  in 
the  design  process.  One  method  is  to  induce  a  distribution  on  the  parameter 
set.  Another  method  is  to  allow  for  variations  about  a  nominal  value.  A 
third  method  is  to  assume  that  the  parameter  lies  within  a  given  set,  and 
could  take  values  which  are  unfavorable  to  the  designer. 

The  objectives  for  the  design  of  controllers  for  handling  these 
uncertainties  may  vary.  One  may  only  desire  stable  performance  over  the  whole 
range  of  parameter  values.  One  may  desire  that  the  cost  incurred  for  the 
controller  at  each  value  of  the  parameter  be  less  than  a  certain  value. 

Another  objective  may  be  to  achieve  near  optimal  performance  for  as  wide  a 
range  of  parameter  values  as  possible.  The  two  design  procedures  discussed 
in  Chapters  3  and  4  are  based  on  certain  assumptions  about  the  parameter  sets, 
and  achieve  design  objectives  based  on  these  assumptions. 

In  Chapter  3,  we  assumed  that  the  parameter  set  is  a  compact  set 
centered  about  a  nominal  value.  We  assumed  each  value  equally  likely;  so  we 
induced  a  uniform  distribution  on  the  parameter  set.  Our  design  objective  was 
to  have  as  near  optimal  performance  as  possible  for  the  widest  range  of  off 
nominal  values  of  the  parameter  in  the  set.  Using  the  assumptions  on  the 
parameter  set,  we  were  able  to  formulate  a  problem  where  we  minimized  the 
average  cost  over  the  entire  set. 
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The  resulting  problem  is  a  constrained  optimization  problem. 

Ashkenzai  and  Bryson  [1]  developed  a  method  of  solution  when  the  parameter 
set  has  a  finite  number  of  values,  and  a  discrete  distribution  is  assumed. 

This  procedure  was  extended  here  to  solve  problems  when  the  parameter  set  is 
compact  and  a  continuous  distribution  is  assumed. 

Some  scalar  examples  were  investigated  to  see  what  design  objectives 
were  met.  It  was  found  that: 

1)  Performance  was  best  at  values  of  the  parameter  near  the  nominal  value, 
but  biased  towards  those  values  that  are  associated  with  higher  costs. 

2)  The  performance  generally  worsened  as  the  value  of  the  parameter  under 
consideration  is  further  away  from  the  nominal  value. 

3)  The  performance  is  near  optimal  for  certain  values  of  the  parameter,  but 
not  optimal  at  any  value,  so  that  some  performance  is  always  sacrificed  at  all 
values  of  parameter  in  order  to  achieve  less  sensitivity. 

In  Chapter  4,  we  assumed  that  the  unknown  parameters  were  in  convex, 
compact  sets  centered  around  a  nominal  value.  Our  design  objective  was  to 
find  the  best  controller  for  the  worst-case  model.  Solving  this  problem 
directly  is  infeasible  since  we  cannot  describe  all  models  associated  with 
every  admissible  control.  Another  problem  is  to  find  the  best  control  for 
all  models,  and  then  to  find  the  model  that  exhibits  the  worst  cost  when 
excited  by  its  optimal  controller.  The  two  problems,  described  by  (8)  and  (9), 
are  equivalent  if  a  saddle  point  exists  at  the  solution  point  of  (9) . 

We  outlined  a  procedure  for  solving  the  maximin  problem,  (9) ,  by 
using  the  Frdchet  differentials  of  the  optimal  cost  function,  (48).  Looze, 
Poor,  et  al.  [7]  have  shown  that  a  saddle  point  exists  for  a  general  class  of 


uncertainty  sets  where  the  uncertainty  lies  in  the  noise  covariance.  We  were 
unable  to  establish  the  same  type  of  result  when  the  uncertainty  is  in  the 
system  dynamics  because  the  formulation  of  JCu,0)  in  (5)  became  too  complicated 
However,  a  summary  of  the  conditions  a  problem  must  satisfy  for  a  saddle  point 
to  exist  was  detailed  in  Theorem  2. 

We  then  considered  the  equivalence  of  (8)  and  (9)  for  scalar  systems 
and  parameter  sets  that  are  intervals  on  the  real  line.  We  found  cases  where 
the  saddle  point  does  not  exist,  but  we  were  able  to  describe  all  the  sets 
satisfying  the  assumptions  such  that  a  saddle  point  does  exist. 

The  examples  of  Chapter  3  were  then  investigated  in  Chapter  4  to  see 
what  design  objectives  were  met.  It  was  found  that: 

1)  The  controllers  are  optimal  at  the  maximin  point,  so  that  there  exist  some 
values  of  the  parameter  such  that  the  performance  of  the  minimax  controllers 
is  better  than  the  least  sensitive  controllers. 

2)  The  performance  worsens  as  we  consider  values  of  the  parameter  that  are 
further  away  from  the  maximin  point  to  such  a  degree  that  we  found  the  least 
sensitive  controllers  exhibit  superior  performance  over  most  of  the  range  of 
uncertainty. 

3)  We  found  that  in  some  cases  the  maximum  cost  for  the  minimax  controllers 
is  less  than  the  maximum  cost  for  the  least  sensitive  controllers. 

In  Chapter  5,  we  considered  a  two  dimensional  system  in  orde~  to 
illustrate  some  additional  properties  of  these  two  types  of  controllers. 

First,  the  designs  of  full-order  and  reduced-order  least  sensitive  controllers 
were  considered .  We  saw  that  performance  was  sacrificed  when  the  order  of  the 
controller  was  reduced,  but  this  sacrifice  is  a  small  percentage  of  the  total 
average  cost,  so  that  it  may  be  desirable  to  use  a  reduced  order  controller 


over  the  full  order  controller  if  the  savings  in  the  fixed  costs  are 
comparable  to  the  additional  operating  costs  exhibited  by  the  reduced  order 
controllers. 

Second,  a  numerical  procedure  for  designing  minimax  controllers 
for  specific  examples  was  illustrated.  The  maximin  solution  was  found  by 
investigating  a  plot  of  the  optimal  cost  for  each  value  of  the  parameter.  The 
saddle  point  condition  was  checked  using  a  plot  of  the  cost  incurred  by  the 
maximin  controller  for  each  value  of  parameter  in  the  set.  Using  this 
procedure  we  found  examples  where  the  saddle  point  does  not  exist,  and  we  saw 
that  these  maximin  controllers  exhibited  poor  performance. 

We  compared  the  performances  of  the  minimax  and  maximin  controllers  to 
that  of  the  least  sensitive  controllers  for  these  examples.  It  was  seen  that  in 
many  cases  the  minimax  controllers  exhibited  better  performance  than  the  first- 
order  controllers,  but  were  never  better  than  the  full  order  controllers. 

The  choice  of  minimax  controllers  or  least  sensitive  controllers 
must  be  based  on  one's  initial  design  objectives.  If  we  simply  desire  better 
performance  over  the  widest  range  of  parameter  values,  the  least  sensitive 
controllers  would  be  preferred.  However,  there  are  cases  when  the  maximum 
value  of  the  least  sensitive  controllers  exceeds  that  of  the  minimax  control¬ 
lers,  in  which  case  the  maximin  controllers  may  be  preferred  if  there  is  a 
ceiling  on  the  maximum  cost  that  is  tolerable. 

Reduced-order  least  sensitive  controllers  may  be  preferred  over 
higher  or  full-order  controllers  if  the  saving  in  the  fixed  costs  of  the  order 
reduction  is  comparable  to  the  operating  losses  incurred  by  reducing  the  order. 
One  avenue  for  future  work  may  be  to  systematize  the  decision  process  of  what 
order  is  most  desirable  for  a  certain  class  of  systems. 


There  is  also  a  need  to  obtain  saddle  point  results  for  maximin 
controllers  designed  for  systems  of  higher  dimension.  There  may  be  a  class 
of  parameter  sets  such  that  a  saddle  point  condition  for  each  example 
considered  may  be  relieved  to  some  degree. 

Furthermore,  our  choices  of  design  are  not  necessarily  restricted  to 
the  two  methods  considered  in  this  thesis.  One  idea  is  to  use  a  combination 
of  the  two  designs,  for  instance,  one  can  use  a  least  sensitive  controller 
designed  for  an  interval  that  is  wider  than  the  interval  under  consideration, 
and  thus  can  reduce  the  high  maximum  costs  that  are  prevalent  at  the  end  points 
of  the  intervals.  Of  course,  this  sacrifices  performance  at  other  values  in  the 
interval.  Therefore,  one  should  not  necessarily  restrict  his  choices  of 
controllers  to  the  designs  considered  in  this  paper.  Our  main  interest  is  in 
designing  the  controller  that  best  achieves  our  design  objectives. 
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APPENDIX 


A  CONJUGATE  GRADIENT  ALGORITHM 


This  program  solves  the  necessary  conditions  of 
the  optimization  problem  in  Chapter  3«  A  conjugate 
gradient  algorithm,  ZXCGR,  is  used  frcm  the  IMSL 
library  [6].  An  explanation  of  the  algorithm  can  be 
found  in  [8]. 

This  program  first  reads  in  the  system  information, 
then  calculates  an  approximating  discrete  distribution 
to  the  uniform  distribution.  Then,  after  being  given 
the  Initial  values  of  P,  the  program  calls  the  gradient 
routine,  ZXCGR,  which  calls  JTP  for  the  values  of  JAY 
and  its  gradients,  DHAM. 


DOUBLE  PRECISION  S(20 ,20) ,G( 10, 10) ,N(20 ,20) ,H( 10 , 1 0) ,L( 10 , 10) , 
A  V(20,2Q) , W(20 ,20) ,WU( 10 , 1 0)  , 

A  PT, T(2, 1000) , 

A  P(10),DHAM(10),WK(60), 

A  ACC, MAXFN, DFPRED, J AT, AT, BT, INC, VAL , TEM, E 

INTEGER  NX, NU, NY, NZ , NW, NV, NDIST, 

A  NFT, NGT, NKT, NHT, 

A  FT (5, 3) »GT(5i3) »KT(5»3) *HT(5,3) , 

A  NP,IER,I,J 

COMMON  S(20,20),G(10,10),N(20,20),H(10,10),L(10,10), 

A  V(20,20) ,W( 20,20) ,WU( 10,10), 

A  PT, T(2,1000) , 

A  NX, NU, NY, NZ ,NU, NV , NDIST, 

A  NFT,  NGT,  NKT,  NHT, 

A  FT(5,3),GT(5,3),KT(5,3),HT(5,3) 

EXTERNAL  JTP 

This  section  reads  in  the  system  matrices,  their 
dimensions,  and  the  location  of  the  unknown  parameters 
in  the  system  matrices. 

These  are  the  dimensions  of  the  system  matrices. 

READ(5,f)NX,NU, NY, NZ,NW,NV, NDIST 
This  reads  the  entries  of  the  "F"  matrix. 

READ(5 , *) ((S(I,J),J=1 , NX), 1=1 , NX) 

This  reads  the  location  of  the  unknown  parameters  in  F. 


NFT-  number  of  parameters  in  F 
FT(I,1)-  row  location  of  parameter 
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FT(I,2)-  column  location  of  parameter 

FT (I, 3)-  entry  of  T  vector  to  be  put  in  this  location 

READ(5,*)NFT,((FT(I,J),Ja1,3),I=1,NFT) 

Entries  of  the  "G"  matrix. 

READ(5,»)((G(I, J) , J=1 , NU) ,1=1 ,NX) 

READ(5,#)NGT, ((GT(I,J),Ja1,3),I=1,NQT) 

Entries  of  the  "Kn  matrix 

READ (5 • •)  ( (N(I, J) , J=1 ,NW) ,1=1 ,NX) 

READ (5 , *)NKT, ( (KT(I , J) , J=1 ,3),I=1,NKT) 

Entries  of  the  "H"  matrix 

READ(5,«)((H(I,J),Ja1,NX),I=1,NX) 

READ(5, »)NHT, ( (HT(I, J) ,  J=1 ,3) ,1=1 ,NHT) 

The  following  read  the  entries  of  the  L,  Q,  R,  WX, 
and  UO  matrices,  respectively. 

READ(5, •) ((L(I, J) , J=1 ,NV) ,1=1 ,NX) 

READ(5,  •)  (  (V(I,  J)  ,  Jal , NW)  ,1=1  ,NVJ) 

READ (5 , •) ( (V (NW+I , NW+J) , Jal , NV ) , 1= 1 , NV ) 

READ (5, *)((W(I,J),Ja1 ,NX) , 1=1 ,NX) 
READ(5,»)((WU(X,J),Ja1,N0),Ia1,ND) 

This  section  calculates  an  approximating  discrete 
distribution  to  the  uniform  distribution  for  a 
one  dimensional  T  vector.  User  must  supply  any 
code  needed  for  higher  dimensional  T  vectors. 

This  reads  in  the  desired  interval  of  T  values  for 
the  uniform  distribution. 

READ(5, *)AT,BT 

This  calculates  the  discrete  distribution. 

IF(NDIST.EQ.1)G0T03 

E=0.0 

DO  1  1=1 , ND1ST-2 
E=E+I»»2 

TEM=DSQRT( 1 2.0»( 1 .0/NDIST- ,25+E/ (NDIST«(NDIST-1 ) »«2) ) ) 
INC=(BT-AT)/TEM/ (NDIST-1 ) 

VAL=(AT+BT)/2-(BT-AT)/2/TEM 

DO  2  Ial.NDIST 

T(1,I)=VAL 

VALaVAL+INC 

GOTO  4 

T(1,1)aAT 
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C  This  section  provides  the  initial  value  of  P  for 
C  the  local  gradient  algorithm,  ZXCGR,  and  also  some 
C  other  initial  values  the  routine  needs.  See  Reference 
C  [6]  for  details  on  the  usage  of  ZXCGR. 

C 

4  PT=1 .0/NDIST 
DO  5  1=1, NX 
DO  5  J=1 , NX 

5  W(I, J)=PT*W(I, J) 

NP=(NU+NY)*NZ 

READ(5,»)(P(I),I=1.NP) 

READ(5,*)DFPRED 

ACC=.1e-9 

MAXFN=20 

CALL  ZXCGR ( JTP, NP, ACC, MAXFN , DFPRED, P, DHAH, J AY, WK, IER ) 

ZXCGR  yields  the  stationary  values  of  P,  the  value  of  the 
cost,  JAY,  and  the  gradients,  DHAM,  at  those  values  of 
the  vector,  P. 

PRINT100, (P(I) ,1=1 ,NP) 

PRINT1 00, (DHAM(I) ,1=1 ,NP) 

30  F0RMAT(2X,5(E14.8,2X)/) 

PRINT*, JAY 

STOP 

END 


SUBROUTINE  JTP (NP, P, JAY, DHAM) 


This  subroutine  calculates  JAY,  and  its  gradients,  DHAM, 
which  is  the  information  needed  for  the  IMSL  conjugate 
gradient  algorithm,  ZXCGR  [6] . 


DOUBLE 

A 

A 

A 

A 

A 

A 

A 

A 


PRECISION  S(20 ,*20)  ,G(  10, 10) , N(20, 20)  , H(  10, 1 0)  ,L(  10,10) , 

V (20 ,20) , W(20 , 20) , WU( 10 , 10) , 

PT, T(2, 1000) , 

B(  10, 10), COO,  10)  ,EB(  10,10), DC(  10,10)  , 

DS(20,20) , DN(20 ,20) ,DW(20,20), 
COV(20,20),LAM(20,20), 

ALPHT(190,190),RESB(190,190)  ,RVCR  ( 1 90)  ,RVLR(  1 90) , 
RES(20,20),RESA(20,20),RVC(190) ,RVL(190), 

P(NP) ,DHAM(NP) , JAY, work(200) 


INTEGER  NX, NU,NY,NZ ,NW, NV , NDIST, 


A 

A 

A 

A 

COMMON 

A 

A 


NFT,NGT, NKT.NHT, 

FT(5,3),OT(5,3),KT(5,3),HT(5,3), 

NP,  NZT,  NS,  NN,  NVEC, 

I, J, IA, IB , IC, ID, IP,M, LM, KK, II, LL, JJ, IER 
S(20,20),G(10,10),N(20,20),H(10,10),L(10,10), 
V (20 ,20) , W(20 ,20) ,HU( 10, 1 0) , 

PT, T(2, 1000) , 


A 

A 

A 


NX,  NO ,  NY,  NZ ,  NW ,  NV ,  NDIST, 

NFT,  NGT,  NET,  NHT, 
FT(5,3),GT(5,3),KTC5,3),HT(5,3) 

Some  initializations  that  are  needed. 

NS=NX+NZ 

NN=NW+NV 

NVEC=NS*(NS+1 )/2 

NZT=NZ/2*2 

JAYsO 

DO  6  1=1, NP 
DHAM(I)=0 

This  section  puts  the  values  of  P(IP)  into  the 
appropriate  entries  of  A(P),  B(P),  and  C(P). 

IP=0 

DO  1  1=2, NZ, 2 
IP=IP+1 

S(NX+I,NX+I-1)=P(IP) 

IP=IP+1 

S(NX+I,NX+I)=P(IP) 

S(NX+I-1,NX+I)=1 

IF(NZT.EQ.NZ)G0T02 

IP=IP+1 

S(NS,NS)=P(IP) 

DO  3  1=1, NZ 
DO  3  J=1 ,NY 
IP=IP+1 
B(I, J)=P(IP) 

DO  4  1=2, NZ ,2 
C(1,I)=1 

IF(NZT.NE.NZ )C( 1 , NZ ) =1 

DO  5  1=2, NO 

DO  5  J=1  ,NZ 

IP=IP+1 

C(I,J)=P(IP) 

This  loop  sums  JAY  and  DHAM  for  all  the  values  of  T 
in  the  distribution  that  was  passed  from  the  main 
program. 

DO  1000  ID=1, NDIST 

This  section  puts  the  values  of  T  into  the  appropriate 
entries  of  F,  G,  X,  and  H,  according  to  the  positions 
indicated  by  FT,  GT,  XT,  HT,  respectively. 

DO  10  1=1, NFT 

S(FT(I,1),FT(I,2))=T(FT(I,3),ID) 

DO  11  1=1, NGT 

G(GT(I,1),GT(I,2))=T(GT(I,3),ID) 
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DO  12  1=1 , NKT 

12  N(KT(I, 1 ) ,KT(I,2))=T(KT(I,3) ,  ID) 

DO  13  1=1, NHT 

13  H(HT(I,1),HT(I,2))=T(HT(I,3),ID) 

C 

C  This  section  calculates  S,  W,  and  NVN'. 

C  These  are  needed  to  solve  the  Lyapunov  equations, 

C  (24),  and  (27). 

C 

CALL  VMULPF (G , C , NX, NU , NZ , 1 0 , 1 0 , RES , 20 , IER ) 

DO  20  1=1, NX 
DO  20  J=1 ,NZ 

20  S(I,NX+J)=RES(I,J) 

CALL  VMULFF(B , H, NZ , NY, NX, 1 0 , 1 0 , RES , 20 , IER ) 

DO  21  1=1, NZ 
DO  21  J=1 , NX 

21  S(NX+I,J)=RES(I,J) 

CALL  TMULFF(B,L, NZ , NY, NV , 10, 10, RES, 20 , IER) 

DO  22  1=1 ,NZ 
DO  22  J=1,NV 

22  N(NX+I,NW+J)=RES(I,J) 

CALL  VMULFM(C , WU, NU , NZ , NU , 1 0 , 1 0 , RES A, 20 , IER ) 

CALL  VMULFF(RESA, C, NZ , NU, NZ ,20, 10,RES, 20 ,IER) 

DO  23  1=1, NZ 
DO  23  J=1,NZ 

23  W(NX+I,NX+J)=PT*RES(I, J) 

CALL  VMULFF(N, V. NS , NN, NN, 20 , 20 , RESA, 20 , IER ) 

CALL  VMULFP (RES A, N, NS , NN, NS , 20 , 20 , RES , 20 , IER ) 

C 

C  This  section  rearranges  the  entries  of  S  Into  a  matrix, 

C  ALPHT,  so  that  the  equations,  (24)  and  (27),  are  of  the 
C  form,  "Ax=BB,  which  is  suitable  for  inversion.  See 
C  Reference  [ 4] . 

C 

M=0 

DO  70  11=1, NS 
DO  70  JJ=II , NS 
M=M+1 

IF (II . EQ . J J )GQTQ7  8 
RV  C (M ) =-RES ( II , JJ ) 

RVL(M)=-W(II,JJ) 

G0T079 

78  RVC(H)*-RES(II,JJ)/2 
RVL(M)=-W(II,JJ)/2 

79  LM*0 

DO  70  RK=1 , NS 
DO  70  LL=KK, NS 
LM=LM*1 

IF ( KK. EQ . II . AND . LL . NE. JJ )G0T07 1 
IF ( KK . NE . II . AND . LL . EQ . J J ) G0T07 2 

IF (KK. NE.II. AND. LL.NE.JJ. AND.KK.EQ.JJ. AND.LL.NE.il )G0T073 
IF(KK.NE.II.AND.LL.NE.JJ.AND.KK.NE.JJ.AND.LL.EQ.II)G0T074 
IF(KK. NE.II. AND. LL.NE.JJ. AND. KK.NE.JJ. AND. LL.NE.II)G0T075 


IF(KK.EQ.II.AND.LL.EQ.JJ.AND.KK.EQ.JJ.AND.LL.EQ.II)G0T076 
IF(KK.EQ.II. AND. LL.EQ.JJ. AND. KK.NE.JJ. AND.LL.NE.il )G0T077 
ALPHT(M,LM)=S(LL,JJ) 

GOTO70 

ALPHT(M, LM)=S(KX, II) 

GOT070 

ALPHT(M,LH)=S(LL,  II) 

G0T070 

ALPHT(M,LM)=S(KK,JJ) 

GOT070 

ALPHT(M,LM)=0 

G0T070 

ALPHT(M,LM)sS(KK,II) 

GOTO70 

AL  PH  T  ( M ,  LM  )  sS  (  KK ,  II )  *S  (  LL ,  J  J  ) 

CONTINUE 

This  subroutine  inverts  ALPHT  so  that  COV  and  LAM 
can  be  calculated.  A  warning  -  LINV2F  often  will 
not  work  for  smaller  sizes  of  ALPHT  (dimension  3X3) • 

CALL  LINV2F (ALPHT, NVEC, 1 90 ,RESB , 12, WORK, IER) 

This  section  calculates  COT  and  LAM  using  the  inverted 
ALPHT. 

CALL  VMULFF(RESB , RVL, NVEC, NVEC, 1 , 1 90 , 1 90 , HVLfi, 1  90  ,IER) 
CALL  VMULPM(RESB , RVC, NVEC, NVEC, 1 , 1 90 , 1 90 , HVCH , 1 90 , IER ) 
IP=0 

DO  31  1=1, NS 
DO  31  J=I,NS 
IP=IP+1 

COV (I, J)=RVCR(IP) 

COV  ( J ,  I)=RVCR  (IP) 

LAM(I,J)=RVLR(IP) 

LAM(J, I)=RVLR(IP) 

This  section  calculates  JA7=tr[W*C0V] . 

CALL  VMULFF (W, COV , NS , NS , NS , 20 , 20 , RES , 20 , IER ) 

DO  32  Is 1 , NS 
JAYsJAY+RES(I,I) 

This  section  calculates  the  gradients  of  JAY,  DHAM, 
for  the  P's  that  are  entries  of  the  "AtP)”  matrix. 

For  the  "A"  parameters- 

DHAMstr{ [LAM*DS+DS ' »LAM]«C0V) 

IP=0 

DO  MO  1=2, NZ, 2 
IP=IP+1 

DS(NX+I, NX+I-1 ) si 


'  ■’  .  .V/  '  va’;-  . 
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CALL  VMULFF(DS ,  COV ,  NS ,  NS,  NS ,  20 , 20 , HESA, 20 , IER ) 
CALL  VMULFF(RES A, LAM, NS , NS , NS , 20 , 20 , RES , 20 , IER ) 
DO  41  IA=  1 ,  NS 

DHAM(IP)=DHAM(IP)+2*RES(IA, IA) 

DS ( NX+I , NX+I- 1 ) =0 
IP=IP+1 

DS (NX+I, NX+I )=1 

CALL  VMULFF (DS , COV , NS , NS , NS , 20 , 20 , RES A, 20 , IER ) 
CALL  VMULFF (RES A, LAM, NS , NS , NS , 20 , 20 , RES , 20 , IER ) 
DO  42  IA= 1 , NS 

DHAM(IP)=DHAM(IP)+2«RES(IA,IA) 

DS( NX+I, NX+I )=0 
IF (NZ  T. EQ. NZ )G0T044 
IP=IP+1 
DS(NS,NS)=1 

CALL  VMULFF(DS , COV , NS , NS , NS , 20 , 20 , RESA, 20 , IER ) 
CALL  VMULFF (RESA, LAM, NS , NS , NS , 20 , 20 , RES , 20 , IER ) 
DO  43  IA=1,NS 

DHAM(IP)=DHAM(IP)+2*RES(IA,IA) 

DS  ( NS ,  NS )  =0 

This  section  calculates  the  gradients  of  JAY,  DHAM, 
for  the  P’s  that  are  entries  of  the  "B(P)"  matrix. 
For  the  "B"  parameters  - 

DHAM=tr{ [LAM*DS+DS ’ *LAM]*C0V } 

+tr{LAM*[DN*V »N ’ +N*V  *DN • ] } 

DO  50  1=1, NZ 
DO  50  IA=1,NY 
IP=IP+1 
DB  (I ,  IA)  =  1 

CALL  VMULFF(DB,L, NZ ,NY,NV, 10, 10, RES, 20, IER ) 

DO  53  IB=1  ,NZ 
DO  53  IC=1 ,NV 

DN  ( NX+IB ,  NW+IC ) =RES  ( IB ,  IC ) 

CALL  VMULFF(DN , V , NS , NN, NN, 20 , 20 , RESA, 20 , IER ) 

CALL  VMULFP (RESA, N, NS , NN, NS , 20 , 20 , RES , 20 , IER ) 
CALL  VMULFF(N, V, NS, NN, NN, 20,20, RESB , 1 90 , IER ) 

CALL  VMULFP (RESB , DN , NS , NN , NS , 1 90 , 20 , RES A, 20 , IER ) 
DO  54  IB=1  ,NS 
DO  54  IC=1 , NS 

RES ( IB , IC ) =RES ( IB , IC ) +RES A (IB , IC ) 

CALL  VMULFF (LAM, RES , NS , NS , NS , 20 , 20 , RES A , 20 , IER ) 
DO  55  3B=1,NS 

DHAM(IP)=DHAM(IP)+RESA(IB ,  IB ) 

CALL  VMULFF (DB , H , NZ , NY , NX , 1 0 , 1 0 , RES , 20 , IER } 

DO  56  IB=1 ,  NZ 
DO  56  IC=1,NX 
DS ( NX+IB , IC ) =RES ( IB , IC ) 

CALL  VMULFF (DS , COV , NS , NS , NS , 20 , 20 , RESA, 20 , IER ) 
CALL  VMULFF (R  ES  A, LAM , NS , NS , NS , 20 , 20 , RES , 20 , IER ) 
DO  57  IB=1,NS 


•»>  .‘r  . 


'  V- -WVh' 


57  DHAM(IP)sDHAM(IP)+2*RES(IB ,  IB) 

DO  58  IB=1  ,NZ 

DO  59  IC=1,NV 

59  DN ( NX+IB , NW+IC ) sO 
DO  58  IC=1 1 NX 

58  DS ( NX+IB, IC)=0 

50  DB(I,IA)=0 

C 

C  This  section  calculates  the  gradients  of  JAY,  DHAM, 

C  for  the  P's  that  are  entries  of  the  ”C(P)"  matrix. 

C  For  the  "C"  parameters  - 
C 

C  DHAM= tr { [DW+LAM*DS+DS • *L AM]  *C0V } 

C 

DO  60  I=2,NU 
DO  60  J=1 ,NZ 
IP=IP+1 
DC(I,J)a1 

CALL  VMDLFM(DC, WO,NO, NZ , NO, 10, 10, RES A, 20, 1ER) 

CALL  VMOLFF (RESA, C, NZ , NO , NZ , 20 , 1 0 , RES , 20 , IER ) 

DO  61  IAs 1 , NZ 
DO  61  XBzI.NZ 

61  DW  ( NX+IA, NX+IB ) =RES ( I A, IB ) 

CALL  VM0LFM(C, WO, NO,NZ ,NO, 10, 10, RESA, 20, IER) 

CALL  VMOLFF (RES A, DC , NZ , NO , NZ , 20 , 1 0 , RES , 20 , IER ) 

DO  67  IA=1 ,NZ 
DO  67  IB=1  ,NZ 

67  DW  (NX+IA, NX+IB ) =DW (NX+IA, NX+IB ) +RES (IA,  IB ) 

CALL  VMOLFF (DW , COV , NS , NS , NS , 20 , 20 , RES , 20 , IER ) 

DO  62  IAsl.NS 

62  DHAM(IP)=DHAM(IP}+PT*RES(IA, IA) 

DO  63  IA=1 ,NZ 

DO  63  IB=1  ,NZ 

63  DW( NX+IA, NX+IB  )=0 

CALL  VM0LFF(G , DC, NX, NO , NZ , 1 0 , 1 0 , RES , 20 , IER ) 

DO  64  IAsl.NX  , 

DO  64  IB  si  ,1 G 

64  DS  (IA,  NX+IB  )sRES(IA,  IB) 

CALL  VM0LFF(DS , COV , NS , NS , NS , 20 , 20 , RESA, 20 , IER ) 

CALL  VMOLFF(RESA, LAM, NS , NS , NS , 20 , 20 , RES , 20 , IER ) 

DO  65  IAsI , NS 

65  DHAM(IP)=DHAM(IP)+2*RES(IA, IA) 

DO  66  IAsl.NX 

DO  66  IB=1 ,  NZ 

66  DS(IA,NX+IB)sO 

60  DC(I,J)=0 
C 

C  Continue  with  another  value  of  T  from  the  distribution 
C  provided  by  the  main  program. 

1000  CONTINOE 
RETORN 
END 
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